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FIGURE 3 
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FIGURE 3A (contd.) 
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FIGURE 3A (contd.) 
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FIGURE 3A (contd.) 
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FIGURE 3A (contd.) 
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FIGURE 3B (contd.) 
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TGTG T A T T T T GT T A A T AGT A T C AG G T TG TT T ATT AG G AC T G 
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i A A G T I T T T AGTT PC A AC C G'AGA A A C AA . , C A A C TTG A*TO 
J A A G T f GG T G G : A b TT T 6 A A C: O C A G A A A C A A A C A A C T T G A T G 
'sCAGCVGci PGG : AGC'l CAAC't 'CGAAACA'a .CAACCTAAT^ 
G GCAAT G G T C~ O G C A T I'TAA C C C TG AGTCTAA - -~- - - t|CCG 
GAAGTTGGTGGAGTTTCAACCC AG AAAC A A AC AACTTGATG 

— Section 1 1 

411 m 420 430 440 451 

l GTATAGAT A''GAAAGGAACAA ZQ1 ATGT I A '-GCCG A X AAT 
I GTAT^GAT/iT^AAGGGAAGGA ! G/!': TGT7A 3GCCGATAAT 
r6TA f T 'AGMATGAAACGTACTG'; G \ TGT r A^ACC'CA^TAT 

T AGGT - r l CAATACTCCTATCTA- - - ATGp'f CAACA'AT 

TGTATAGAT ATGAAAGGT ACTATGTATGTTAGGCCGATAAT 
— m „ _ Section 12 

452 460 470 480 492 



A G A GG A T ; T A C C A T A C A OTA A C AGCC A C T A TC A T T C GTGGT C 
GT A ATT TT GC7 A T A G A — G AG T G*1*G C C AA'T - - - ~k i G X G 
TGAGGATTACCAT ACACTGACGGTCACAATAATACGTGGTC 
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FIGURE 3B (contd.) 
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— Section 14 

534 ,540 £50 £60 574 
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GAGG GglCA G TGGVC T^G C - - - - - TAA ATGTGAACC AGACfeAO 
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561 ) G A r ;< 7 G T G ? T T TT G 7 TG r T r A T G T T A A G T C C A A AC T C 
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FIGURE 3C 
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FIGURE 3C (contd.) 
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A7AATGGC 
7 T A ' f A AT G G*C 

T O G C C C A A T A T A GGAp.C CT AT C G 3? TT-G T T A ATG TG T C TT AT G C C T &T G G A3S G T 
TGGCTGTGTATCGCAGCCTTACTTTTGTTAATGTACCATATGTTTATAATGGC 

— — - Section 10 
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A AT GiCA A AG C C C GC C T GgJA T^K *££:C AAGA G A C A A T A C T T T A A C A C T C A AT AAC 0 jG 
TCTGCACAATCT ACAGCCCTTTGTAAATCTGGT AGTTT AGT CTTAATAACCC 

- - — - - --- - - •••• — Section 1 1 
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T G i ' A " 1 A T A ' * GTTGCTC - .AG "J A - , C i C fGGGG A: " r- -'A -T fAAi - G T T GAAw 
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, — ^ _ , — Section 12 
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j G. ■ ' ; A TAT A AJ'JG v - * ' A , LGAAgGA GA A A / T G 1 I A GG ACGAT - ' ATTA'TC 
rAATTTCACAC FAGAA'3* TTGTGA TG ? ATT iAn CAC ; V GGA '■ C r, G GGTT tS* 

ctgatttttatttgtcaggttgtgacgagtatatcgtaccactttgtattttt 

- — — — — • - - - - - -- Section 13 
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A A C G G CA AGT TT T T G - - TCOAA A AG A ------ A AGT ATT ATjS A TG A 
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— - - - - -- — - - - - — - — ~ - — — — — — — Section 1 4 
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T AGT G A AT A FT AT I T V A M A A AC 'A G A G T GGTC IT TA'i TJ'A TGGT CT 0 A ATTC^A 
TAGT fA.ATATTATTT TA :• r A A AG A C A CTGG TGT T ATT 'I A T A GT G T C A ATT CT A 
CTCT C AGAG T T A C TAT A A1 AT G GA TAT 'i' G G T G T C T T A T A TU GGT T C AATT CGA 
TAGTCAATATTATTTTAATAAAGACACTGGTGTTATTTATGGTCTCAATTCTA 
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FIGURE 3C (contd.) 
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.. - — — Section 19 

955 §60 _ £70 ,980 990 1007 
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.,, ____ Section 20 
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FIGURE 3C (contd.) 

: Section 22 
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FIGURE 4 

FIGURE 4A 
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FIGURE 4A (contd.) 
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bovine coronavirus pol 1 ab (3777) JitfSttF R M ' ' V V N#;KT.* V QEh R.V M >i A N G l.RP M K-N S FEAL 
Human corona 229E pol 1 ab (3487 } X' P ' c K c 7 1 v ; ' c V * ? a K K f ' v a n c l a < n ' F d a l 
Murine hepatitis pol 1ab (3864) L, .0 IP ilM . VJKYKI Vuh I.P v NNAKC L R V RNSF^AL 
Consensus (3940) I NS I FRMTLGVYN FKI !sVQELRYMNANGLRPPKNS FEAL 



PP20480.019 



31/199 

FIGURE 4A (contd.) 



_ , „ — , Seclion 103 

(3979) 3879. 3990 4000 4017 

avian infectious bronchitis pol 1 ab (3359) S T$ I l^QG rc< ; DRVL 3 FMT v r> a* K b Si 1 ) v K iy w L M<i?LG,T 
bovine corona virus pol 1ab (3816) M l x N ;f'k ilrG I ;0 ^ V i? Wi< E 0*53 Q F Q-d K^T C.v k-0 AN x ? Vi^llb! €?4§& 
Human corona 229E pol 1ab (3526) Fx? S :F K !l M & K V{$ ; T V c 2 S • f r >>'L|;s r- 7 V-li Mg ijJ'S 
Murine hepatitis pol 1ab (3903) M£0!*&^^^^ 

Consensus (3979) M L N F K L L G I G G V R VI E V S T V Q S K L T D V K C T M V V L L N C L Q 
_ „ Section 104 

(4018) 4018 _ 4030 4040 4056 

avian infectious bronchitis pol 1ab (3398) M N V E»A r « WMH V* LVE L- ^ K ' LA'S Vt E - I^G^ti 
bovine coronavirus pol 1ab (3855) HKH^ftsVif c'UWQ" csrflh^rir, :LATSi LG va FE Kl-AQfiip; 
Human corona 229E pol 1ab 3565) M^feliiS^E WAy,&£zM^>MlttLC ^PET^Q^L^SMilA 
Murine hepatitis pol 1ab (3942) £i£H|p^S 

Consensus (4018) H I» N i A S N S K LWQYCVTLHNKI L ATS DLGVAFDKLLQLLI 

— — - — . — — — ~ Section 105 

(4057) 4057 4070 ,4080 4095 

avian infectious bronchitis pol 1ab (3437) tMP^i dIItSS- - - - - -ih ey'c; ^'DX^Ki^ tvj->SMT ;s>;s 

bovine coronavirus pol 1ab (3894) gj^^ggp^ 
Human corona 229E poll ab (3604) FF £|1k hSd F G - — - #G'b L v . Y ^ SI; ii^iJ^^AjS, 1 v 
Murine hepatitis pol 1 ab (3981 ) y&y A K i ' AA-V li S K^A's^fi.E V I ; - D.i VRD N T V ( AT 0 1 > l v 
Consensus (4057) V LEAN P AAV D S KC L SI EEVC DDYLKDNTVLQA'LQ'SE FV 
. , __ _ Section 106 

(4096) 4096 4110 41 20 4134 

avian infectious bronchitis pol 1ab{3470) tip y\Fl * \ RfVKM I <7L V V~ KN\ GVTgp*" d/AA Y N / 

bovine coronavirus pol 1 ah (3333) n - p v T ^ i >v v * k k nt.d k ^ p ft s r; s >: S^^^ic^feE;KKG 

Human corona 229E pol 1ab (3637) C,U p\s f.v A - % T ^OB^fc^X ;j G ^ 5 PQ I IKO'i-;; i AM 

Murine hepatitis pol 1ab (4020) NMA&F*V E V - L. i X K t-J L D E A K A S G S ^ M QvQJ KQ L E - ,C 

Consensus (4096) NMPSFVE YELAKKN YDEARASGS AH 0 Q Q I KQLEKAC 
— _ Section 107 

(4135) 4135 4140 4150 4160 4173 

avian infectious bronchitis pol 1ab (3509) x>"i :\:.s 'TD-dla^qk- lds ER; M f r *. «< -v 'Drra 
bovine coronavirus pol 1 ab (3969 M.-> 5J a|y*E ! - u ? A r A? . LLR> . D I L j l\ '\ - I M D K K S 
Human corona 229E pol 1ab (3672) 

Murine hepatitis pol lab(4056) t i ./s r.Y E" dp a AW^'fl?^^f.' ttk' j . < ::id'kks 
Consensus (4135) N I AKSA F DRDRAVQKKLERMADLALTNMYKEARI n dkks 
— -™ ~ - — — ~» Section 108 

(4174) 4174 4180 4190 _ ; 4200 „ 4212 

avian infectious bronchitis pol 1ab (3548) f^fH 5 L C s L K K J • KLK V r, i)0\' '/VV r _-A V 
bovine coronavirus pol 1ab (4008) ^^S^SS^Hti^S'iVR n;QA-Lh'S ILrN^VKVC^ ^LNAf 
Human corona 229E pol 1ab (3711) ^j^^tH®SG8fe^ I«KK L MiSV^flLMl ir^v; ' - _ *, 
Murine hepatitis poMab (4095) R«V^§ ; X|l(@gj, s^vi « , QA-iiMSfLF^ vk v c'v-nai 
Consensus (41 74) K V V S A L QT LL F S M I*R K L DNQALN S I L D N A V K G VV P L tt A I 



PP20480.019 



32/199 

FIGURE 4A (contd.) 

Section 109 

(42 1 3) 421 3^ 4220 ..,4230 4240 4251 

avian infectious bronchitis pot 1ab (3587) &L}fv S N 1 I v-hVJy o P§wWs7^cy£; t VH i ST y/v v- h ; i ry 
bovine coronavirus pol 1ab (4047) &s|^ 
Human corona 229E pol 1ab (3750) £at|^^ 
Murine hepatitis pol 1ab (4134) t?&j^^ 

Consensus (42 1 3) P S ETa N T L T II V P DK D V F V Q V V D NV Y V T Y A G V V W N I Q T I 

„ _ Section 110 

(4252) 4252 £260 „ 4270 4280 4290 

avian infectious bronchitis pol 1ab(3626) I A 0<J$«-;Lfc P j ^Tc, SGLT YC I SCAN I A w t ^(gNSkju-Hrb 

bovine coronavirus pol 1ab (4086) ,q LV^DCrf §£®Mp bc|j^|^^I A&kfcE 

Human corona 229E pol 1ab (3789) K?-Kiu~»xv' : :f, K DtVT.K Kn||ei1|w Pj| I LTcli 

Murine hepatitis pol 1ab (4173) ^S^i;AVggS^$ : DV -KfsvW f T ^f&&%#W& 

Consensus (4252) Q D A DGT N KQL-N E I S U WW PL VI LNRHNE 
_ Section 1 1 1 

(4291) 4291 , 4300 ,4310 4329 

avian infectious bronchitis pol 1ab(3665) NKV;DVVLQ , l L HGyfl r.KAC vjgtav D^3gHg SV E^K^SjT. 
bovine coronavirus poll ab (4 1 1 6) yJ^A T y l < Q - I . LL 'B|cm^^^y)^s]«» jS P £> Q T I^T^IT^^gf 
Human corona 229E pol 1ab (3819 rvvIkSq- -||$|s^ 

Murine hepatitis pol 1ab (4203) V3\ WI^Q'-i .5 r L M PQKLRT Q V V NSHSSgNSN^Nf 

Consensus (4291) VS V VLQ NkVlMPAKlTTtQVVNS G DA CMTPTQCYY 
^ _ Section 112 

(4330) 4330 4340 4350 m 4368 

avian infecUous bronchitis pol 1 ab (3704) V n> I s q * V v a . . T s . V F , 1 ; v < I, ; e a m q by V p D * 
bovine coronavirus pol 1ab (4153 Ni-fgjN' ng K l v v VlsdVdVl* ytk; lkd: xfvvlii d'-r 
Human corona 229E pol 1ab (3855) NN GGRAF|1X/ :Y ^ T ; TK ^ GM ^ Y ^WE D - VV 1 VE gjl ; 
Murine hepatitis pol 1ab(4240) N ) TGTGKI v Y ; i.SDCDGi; y.t?:tv?:ed kcvvle b>; 

Consensus (4330) NNSGGGKI VYAILSD PG L K YT K I LK.DDGN VVLELDPP 
__ — Section 113 

(4369) 4369 _ 4380 4390 .4407 

avian infectious bronchitis pol 1ab (3743) .K VGVKgryv * i : iWKT'lSI M L^ATS :v\ V 
bovine coronavirus pol 1ab (4192) K' T-VQJV.-;olk:k s r'VK*.CS?:;r V: 'PC5SY-H 
Human corona 229E pol 1ab(3893) " P^F vj|r: TP 7© P C IK^ t v ,nl\ ,lr c a j, yi.,at\ h 
Murine hepatitis pol 1ab (4279) CK;§4/*Q v DV r |§| ^ '^^CNT ^ A • V TLSST/H 
Consensus (4369) C K fsVQ DVKGLK I K YL YFVKNCNT L A RGWV LGT I SSTVR 
„ Section 114 

(4408) 4408 „1420_ ,._4430 4446 

avian infectious bronchitis pol 1ab (3782) |fc§KGHEl : SE^:>AV I S 1 S V A . A T c ' i'VAAO:Q 
bovine coronavirus pol 1ab(4231) *. >-AG-'l A V ^liSSlLS'L; Sl^S KK7' gBtgSpQ'-GT 
Human corona 229E pol 1ab (3932) .AG- 0 F V,S K S Hi; LT ? H-- S A : AA LbAVKQ.M 
Murine hepatitis pol 1ab (4318) ";Xc ™ TA Y. 3 1; 5 AlV, S L J Ai* S • $ LDYKkOkvGV 

Consensus (4408) LQAG TATEVv'SNSAI LS LC AFAV D PKKT YLDYIKQGG 



PP20480.019 



33/199 

FIGURE 4A (contd.) 

, «~ — — - Section 1 1 5 

(4447) 4447 ,4460 4470 4485 

avian infectious bronchitis pol 1ab(3821) \0G n i ^'t^TVHHv^L < A i ^SKPJPT ? I .vs*, '-33V0g 
bovine coronavirus po! 1ab (4269) j/i A ; C T> H A 'i v M " : VK r DAT ; * 0 . \ 

Human corona 2296 pol 1ab (3970) fvgff . ,i*j-t. GSvS. 0 s 'c !DS^r, " T x ^ * , - I 
Murine hepatitis pol 1ab (4356) ^vtmc v 4mx-c d_h a t m ; m l k ^fai^n-q^S'. *"^^Vc;i 

Consensus (4447) P V GNCVK M LTD H A G S G H A I T I KPDATTNQDSYGGASVCI 
- — Section 116 

(4486) 4486 „ _ 4500 4510... 4524 

avian infectious bronchitis po) 1ab (3860) *• -A- IA • G3VG L P 0 K>3 F .1 'TEK:\v;;FC 
bovine coronavirus pol 1ab (4308) j ;ARVE;| : - ~~ - DY' * *V LH * FF 4 V VGfffiS^S YV> 

Human corona 229E pol 1 ab (4009) ' - A r V A 1 / T M • ■ 0 V K « K-W - V - i G T s *w i r f ( C 

Murine hepatitis pol 1ab (4395) A< s SRVE ■ DV M.- 3R KF ;v LG I K. - VSYV 

Consensu s (4486) YCRAR V E H P DVDGLCQ L K G K F V Q V P I G X K.DPVS F V 

- - — - ~~ — — — - — — — ~ - — Section 1 1 7 

(4525) 4525 4530 4540 4550 _ ; 4563 

avian infectious bronchitis poi 1ab (3899) s pn* . T" QC ;i. Y V- D 3 L P Q P KSS V Q S V \ G* A 3 t . F f ) 
bovine coronavirus pol 1 ab (4343) ifHD, ^QiffGF i^DGS ^|fg|1|D V: VQSK.- 
Human corona 229E pol 1ab (4044) 'LENT « GC LMi, T DHI A % Q S b 

Murine hepatitis pol 1ab (4430) . 7 : r g G F ' 3 OC- s -S V r u - - S Q ) Q 3 K|l: 

Consensus (4525) LTKDVCQVCG F W E D6SCSCVST SAIQSKD 
- : — - Section 118 

(4564) 4564 4570 4580 4590_ „ _ ^4602 

avian infectious bronchitis poi 1 ab (3938) UY ' s 3 - F K,! VLnJ "J V7K VC K E 5 
bovine coronavirus po! 1ab (4373) "l llV TSVD P 1 V 7 L' A 3 :. '! VOL IC A 1 
Human corona 229E pol 1 ab (4072) ' ' Y ■ 3 - . APLE? < T ^ I Y C V ■ : V Y J K D A 
Murine hepatitis pol 1 ab (4460) K F '? 3 V , A L v r C A 3 3 - r v 3 : - 1 C A R 
Consensus (4564) T M FL M R . V R G S S V D A RL V PC ASGLDTDV(QLRAFDICNKDA 
„ „ , - Section 119 

(4603) 4603 4610 ,4620 4630 4641 

avian infectious bronchitis pol 1 ab (3976) a g k f q n l ?. : j> 3 d v f : .ace y l s y f y ,;J|t p 

bovine coronavirus pol 1ab (4412) AG l G L ' L v > ir<v:F Gi> v<h:\ fVv'vVetdS. 

Human corona 229E pol 1 ab (4 1 09) S : FIG K M L . S V t v r) , D A F Y .1 ■ R C I K 

Murine hepatitis pol 1 ab (4499) A GIGI Y Y 7 C . Q K V ") F : )G K A , F F V RTN | 

Consensus (4603) AG I G L ^ L K VH CCRFQRV d'e dgd kldaffvvkrt l 
- ----- Section 120 

(4642) 4642 4650 ,4660 _ ^4670 4680 

avian infectious bronchitis pol 1ab (4015) 3NY ' V . KSe F 01 - " - FV , D; ; F > K T VN T 3 - 

bovine coronavirus po! 1ab (4447) T I V K • M r. £ RV DC K fv F :) r V F 3 V E G C RV P IV 
Human corona 229E pol 1ab (4140) SVK H SH ' 3F C:3 V. VW FGRTTYGNVG 

Murine hepatitis pol 1 ab (4534) v y :a k C * f " t E C g v v f e t f d v f g 3 p vp; ; v 
Consensus (4642) SVy'nHEKSC YELLKDC VVAEH DF 'FT FDVEGS RV PN I VR 



PP20480.019 



34/199 

FIGURE 4A (contd.) 



. _ _ Section 121 

(4681) 4681. 4690 _ _ -4700 _ 4719 

avian Infectious bronchitis pol 1ab (4050) |j!Uiih^ 

bovine coronavirus po) 1ab (4486) kdi.t'K YTrfLl>L/.: h' Ki i ;k r C RN t) r; MjbX C'D i :,S I^G'CEQS; 
Human corona 229E pol 1ab (4179) j^b^tf^ 

Murine hepatitis pol 1ab (4573) kbi^Kf^Klj fi lit: V. ; ftjft F riR/fn§ ; r LK<E TL^pf^EgfegS 
Consensus (4681) QDLTK YTMLD LCYALRH FDRN DCEVLKE ILVTYACCEDS 

_ — ~~ Section 122 

(4720) 4720 4730 4740 4758 

avian infectious bronchitis pol 1ab (4089) iijPKW fee; . D. V. PI KYYVMtAKMuPTMPR^i^KAlj 

bovine coronavirus pol lab (4525) |f v T m!$m Y i} fy I K-P-ni I MV y'|3k^L\* PX F ft Rm1^: : 

Human corona 229E po! 1ab(4218) Y~ - - - FF M ^ <v*F \ ? 1 £ I) t^R^AA^K^AK Kt4fe;K<:I| 

Murine hepatitis pol 1ab (4612) V f; K - r /v fv ;:>?nT -l ; NVY K k I-C P J FN F> l/£g.TA 

Consensus (4720) Y FEKKDWYDPIEN PDI I N VYKKLG P XVNRALLNAI 

_ — — , — . - - - — — Section 123 

(4759) 475? .4770 4780 4797 

avian infectious bronchitis pot 1ab (4128) k L?jvi'EK r v ; vi '< '..r V .5 ;n k }>y o u ^QKT-A^rc: A 
bovine coronavirus pol 1ab (4560) l^iS^^ 
Human corona 229E pol lab (4253) AK'DEHHK'; v vi. lw*Cf\ i r-M ryi.cp'r',M 

Murine hepatitis pol 1ab (4647) rADAL EA i. VL T r ; h *r : ^. YGi.;>Sgd. ' ^FVKT7R£e 
Consensus (4759) E FA D L V EKG I> V GVLT L DNQDLNGK FY ID F GD FVKTAPGC 

- , , — Section 124 

(4798) 4798 L 4810 4820 4836 

avian infectious bronchitis pol 1ab (4167) - V p V r ? T * * 1 > II T A • APV Rj^gF* YD- VH K'p-Y:p|; 

hnvinR nnmnavirns pol 1ab (4599) .V'vAJ A^rr S tf]. T,/iCHA; F Ly VN ?N^^L 

Human corona 229E pol 1ab (4292) G^yTcts^ ^? - VMGC' : 1VC IAS tcl|K5gl FG§>t^| 

Murine hepatitis pol 1ab (4686) i*^AV^AT)3 - hi T» r CKA nr. • :iyN- : GTr%B 

Consensus (4798) GV P V ADS yVs YMMPMLTMTHALDS E L FVN D HAY K S 

— — — - ~ - ~— - - Section 125 

(4837) 4837 4850 4860 4875 

avian infectious bronchitis pol 1ab (4205) Y!, m/ - v ;e. QZ Q > Y \i ;'iQE-^i?lCR|^ S^CkR.^ 
bovine coronavirus pol 1ab (4633) F * v£ F DY :.E n * H'.SMP \ ? MT V-DV Q **- D R 
Human corona 229E pol 1ab (4331) K h? . V EH KV U Y - * j CVD K- j|j v ir 

Murine hepatitis poMab (4720) F* .VQ F D- ,LE..- f . ' H .s MT *i;T ( C ; Ex E J DR,', 
Consensus (4837 ) F D L h Q Y D F TO H K L E L F U K Y FKHWSQDY H P N TV DC DDRC 

_ ™ _ Section 126 

(4876) 4876 ,4890 4900 _ _ ^4914 

avian infectious bronchitis pol 1 ab (4244) L 1 I A Z C: T M Q . ' ' . N C / V V , > / F I ;« T C 
bovine coronavirus pol 1ab (4672) i j h ' A.n * 1 i g;«:vl w c; ] - p V: Q*i ''V:,^' 1: fvv's I L 
Human corona 229E pol 1 ab (4370) fpfi^Mu *- T AT r I M ' ? ri g i C g K V I . . V if T A 
Murine hepatitis pol 1ab (4759) : T < A ' «" 1 S M V L ^ C _• , ? _ VHQ I - v;,» r * ^AMdJBX 
Consensus (4876) II HCANFN I LPS TV I PNTC FG PLVRQI FVDGVPFVVSIG 



PP20480.019 



35/199 

FIGURE 4A (contd.) 



_ Section 127 

(4915) 4915 ,4920 .4930 4940 -J*|?-3 

avian infectious bronchitis pol 1ab (4283) * HS-KEI f, J I M-K/Q Q NTMS F S K Jtf C.- Ui- Q k-MQ-fc V G S iwA L L~f G Tl 
bovine coronavinjs po! 1ab (4711) ^^E;£GJD^ 
Human corona 229E pol 1ab (4409) Y ; Rji^ 
Murine hepatitis pol 1ab (4798) T^fS^ 

Consensus (49 15) Y H V K E L G V V M N M D V D T H RY R £ SLKDLLQFVADPALHVAS 
— ~ — - - - - - — - - — - — — Section 128 

(4954) 4954 ,4960 4970 £980_ v 4992 

avian infectious bronchitis pol 1ab (4322) SjwN.uVl.V 1 - . >. ' ^ ^Sj&l^H gT^&F&K s?» xf>|§;fe A 
bovine coronavirus pol 1ab (4750) AShLYI'L- .Cv C A ItW'KFQVy^^^ -iQP' 
Human corona 229E pol 1ab(4448) IjPA V-K ■/ ' V A LST'JLTSvl ' ... r-nVV-'E m>£ 
Murine hepatitis pol 1ab(4837) ^SA 1 ; lA C ' " A « 1T,SGVKF u'iH^ sr-yj^F^^?^gF]E 
Consensus (4954) ASA L V D L R T C C F S V A A I T S G ' V T F Q T V K P G N P NQDFYDFX 

— : Section 129 

(4993) 4993 ,5000 5010 ,5020 _ _ m 5031 

avian infectious bronchitis pol 1ab(4361) E K AMSi^Kp^Sft F >Y QTr;N r -> ? ^ r. 

bovine coronavirus poMab (4789) i; S K^fiKI - SSV D » " TFT ODJ N O^yWy^ YI Lt 1 
Human corona 229E pol 1ab (4487) R^Q&F F&Ei£ **« L ! FT . < f h'!> B R 
Murine hepatitis poMab (4876) LS;K^LLK'*' / s v rr FT^ D K;V\„,f itr KVKLrv 
Consensus (4993) L S KGL LKE G S S V D h K H F F FT Q D G NAA I T DY NYYKYNRP T 
« — _ ~ — Section 130 

(5032) 5032 £040 5050 5060^ 5070 

avian infectious bronchitis pot 1ab (4400) f * F :C'!^MCU Sir CC L 5 ?A 5 J / v VN J L D > 
bovine coronavirus poi 1ab (4828) M,V^t-*KffSS;FVLE WK , - hi I IX < ' : ; P^SO^I "N?iYD. :■ 
Human corona 229E poi 1ab (4526) piV - Gi>ARV~ Y , • A'R . 'C % C , Vu'*< *. S*R r .VV : ^h:V4 
Murine hepatitis pol 1ab (4915) &V ; K T. T, TV L c v K * IF. ; P r , 

Consensus (5032) MV D IK Q LLP V L £ V V AKYF EI YEGGC I P ASQ VI VNNYDKS 
_ - - _ — - — Section 131 

(5071) 5071 5080 t £090 _ 5109 

avian infectious bronchitis poll ab (4439) \,>Y >&- :< .P 'MS- " ^ L F<Z I r ..KY7; PTi;r 

bovine coronavirus poi 1ab(4867) . Y Y , : F .ALSr , hlYAY »<iv V . vjl: 

Human corona 229E pol 1ab (4565) ^w' , * G SISY AiFSL'JSRLI CLM" 

Murine hepatitis po! 1 ab (4954) . Y * K , - R A I. S F " r - Y A Y -R." V . : L ; 

Consensus (5071) AG Y P FN K FG K AELYYEALS FBEQDE I FAYTKRNVLPTLT 
^ _ - : Section 132 

(51 10) 51 10 J5JL?0. £132™ , 5146 

avian infectious bronchitis poi 1ab (4477) gk ! . A <;ti A - I S .FQ^^!;'i:'. . 

bovine coronavirus pol 1ab (4906) . A :3. . . A.; ^ ! : hi v G m^:h.;.kc\ /,i 1 

Human corona 229E pol 1ab(4604) :L ; :.> \ "y ' G* , : L I A j VT- f Q' :'j :KC' xr ; 

Murine hepatitis pol 1ab (4993) -, : K ^ , « ' ; YA A. ^ S j ' G v.M \M , KC 

Consensus (5110) QMNLKYAISAKNRARTVAGVS I.LSTMTGRQFHQKCLKSI 



PP20480.019 



36/199 

FIGURE 4A (contd.) 

. _ Section 133 

(5149) 5149 5160 ,5170 5187 

avian infectious bronchitis pol 1ab (4516) §^N : $i< N A s V^%-£;i^F : Y p ;N ■ r^'iiR-M -f ^» 

bovine coronavirus pol 1ab (4945) aa . ttYv?> 'C ?'l T t . * o ''C * > - M L RRl>LK p V D ^3 ?y .J^tpRTD 
Human corona 229E pol 1ab (4643) VA r « tr'i «' r - ? vCGi-N\N>t Ki i MAD r pDPK > >-\o 
Murine hepatitis pol 1ab(5032) K A' 1 :; tVJyVp?-/ V fc ~ i *f .»*" \ *M RRf iKD?D; :- VI M VHJ 
Consensus (5 1 49 ) VATRNVP V V IGTTKTY GG WD N M L R R L IKDVDD P V L M G W D 
— - — Section 134 

(5188) 5188 5200 JK210 _ 5226 

avian infectious bronchitis pol 1ab(4555) Y i?.KC r* v-* r i>l I 1 A A S f> V j, A R » >.T> ;CSWSER1 ^ H Y1n 
bovine coronavirus pol 1ab (4984) Xl'^C'^.A^Piri LM'I vs ShV GnRKr(fe;/ S0S'f)R-F 
Human corona 229E pol 1ab(4682) < r\ :MTi M L 5> A M^I >GS*> ivt^: TASVKF' S-i; 

Murine hepatitis pol 1ab (5071 ) :* > a ^ u J L V^v^rt^r ^R^>{ b S " / s H^DR'Hi.V r > A M 
Consensus (5188) Y P K C DRAM P N IL R I V S SLVLARKHDSCCS SDRFYRLAN 

, ~ — — — , Section 135 

(5227) 5227 5240 5250 5265 

avian infectious bronchitis poi 1ab (4594) rJ[^^i^T*^k^^^I^fi< E ^ I S < if- j ^Y;- *2 : i f w \ 
bovine coronavirus pol 1ab(5023) thr j IS T MZd cy f\ . ^ 3. f T ^L:ir * :\T 
Human corona 229E pol 1ab(4721) fcLAoVi T'LV* Y.^ JSf z .t F- r * T * O AY/' — , 
Murine hepatitis poMab (5110) c r ' ' s * i mcg:;oy ;v-r« . J * ' P^r^> ' >>: 
Consensus (5227) E C A QV LSEI VM C GG G Y Y V KPGGTS SG DATTAFAN S V F N I 

_ Section 136 

(5266) 5266 5280 5290 5304 

avian infectious bronchitis pol 1ab(4633) ic * ':a: \ - Ri|?|$jl T RD%V YD\ I KS VY. " 1 ;QV irRV 
bovine coronavirus pol 1ab (5062) c.Q A V ^ A i< VCAL M s c'n;gnM?I % £ biiS 1 HA - - ~K v:3 rv • S> 
Human corona 229E pol 1ab (4760) F Q.f\ y "J- "s , L K VL ; S V^S S NC l\ : r-J r^;%=r ^/vSp C NCY^.L-:s 
Murine hepatitis pol 1 ab (5149) §<S^^^^SbS^^S H # E D ?:> s . I R;"-' / . ',: K P 1 ^ s.N v.;; llgg 
Consensus (5266) CQA V SAN VC ALLS VNG K I EDLS I KA LQ KR L Y SMVYRAD 
, - - Section 137 

(5305) 5305 5310 5320 ^ 5330 5343 

avian infectious bronchitis poMab (4672) NF PA/' F Y C — ' L : - 

bovine coronavirus pol 1ab (5101) mv . T E i : a F'" M r H : ; 3 ; • ^; s DY c K " 

Human corona 229E pol 1ab (4799) NV^C'S- - f> F . G Y . Q il < . , ' ; ' TY- L 

Murine hepatitis pol 1 ab (51 88) . ; V P A i \ SE Y.EF i. ^ K H F . M I * G - ( • / S K FjvSfKC? 

Consensus (5305) MVD P A F V S EFYSF LNKHFS MM J 1, SDD GVVC YNST Y A S KG 

,„ _ - ■. — ~~ Section 138 

(5344) 5344. 5350 _ „....5360 5370 5382 

avian infectious bronchitis pol 1ab (4711) LV.Ao{Yfei 1 V . J UV I i ADR v^pasK- r -- i<v : 
bovine coronavirus poi 1ab (5140) YIAH : , A \ ugv . " ? - LIES P/. HM", ' . ^ 

Human corona 229E poi 1ab (4838) Y„i ^ A" .A N STA . iEP^x.- . 1 * 

Murine hepatitis pol 1ab (5227) YM" A f 0 Z V 1 . t: y ' * * S hA / V • T . I'EK V >i \ , 
Consensus (5344) Y I A N*I S A FQ Q VL Y Y Q N N V F M 5 E A K C N V E D I E KG ?H E FC 



PP20480.019 



37/199 

FIGURE 4A (contd.) 



„ _ — _ Section 139 

(5383) 5383 5390 _5400 5410 5421 

avian infectious bronchitis pol 1ab (4750) - W ^H ^MLV'-J/ DGEPKi LI ri'i)rs-H r^G^'CjV-^V n^D^L^ > 
bovine coronavirus pol 1ab (5179) l&a*^ 
Human corona 229E poMab (4877) ?J^i^\^6|jvD|j^ ^ K Y : hh^f % < r,.\H> : V<P G\jr\t by^k4f^M 
Murine hepatitis pol 1ab (5266) ^riT&i^ 

Consensus (5383) SQH TH L V K M D G D D V Y L P Y P D P S R r L G AG VFVDDLLKTDS 

— — Section 140 

(5422) 5422 . 5430 ,5440 5450 5460 

avian infectious bronchitis pol 1ab (4789) <Ay f M- Y 1 A\ - j ^ " _ ~ VH'iBNSif^ K * F*L„A^l]|&£ 
bovine coronavirus pol 1ab (5218) M L Ij?Mi \\FySt..M ^.-.V^LV^rSNE. ;QK rRVY^l EYIKK" 
Human corona 229E pol 1ab (4916) ^K^^^^' 1 "* 1 i,:K ~- f i? ^sVkPKgfe i ? kk ? Y a L kDwylK'^ 
Murine hepatitis pol 1ab (5305) ^£1; !?;F^V Sj- i «y. y :~ v : YH E Ng!r X-Q'^^t^RV^S^l^^^ 
Consensus (5422) V LL I E RFVS LA I D A Y P 1*V Y H E N P E i Y Q K V F R V Y L E Y I K K L 
— , Section 141 

(5461) 5461 s 5470 5480 _ 5499 

avian infectious bronchitis pol 1ab (4828) Y~£nv qnm r M jy^ F^j^u X DK.^^^|^'w^l$Mf^^RApSt 
bovine coronavirus pol 1ab (5257) |#)^^^^^^§^to^^^Ql^Tp £s ; kn <i :lp s Cv ; 
Human corona 229E poMab (4955) NK ; r^*gG^^^P^T^LbE^E;s I \WDF,S.* r A * Mj^S§M : 
Murine hepatitis pol 1 ab (5344) gMzW^&i c s Y 'VIL ST CDC/ ; ~ T D E t ' k n Y i£g§; ^ 

Consensus (546 1 ) Y N D L G N QI L DSY SVILSTCDGSKFWDES PYKNMYLRSTV 

_ ^ __ — . — Section 142 

(5500) 5500 ,5510 5520 5538 

avian infectious bronchitis pol 1ab (4867) L$s C ! J vn.v.vi rr.v^r r (,!< c;v fc U': I i>.K-r *\Cr Jc : pk^S(h> 
bovine coronavirus pol 1ab (5296) M' s.v k . c . S I ha c c* * i Vjgg^ 
Human corona 229E pol 1ab (4994) 1,. A.-. I» " rt^r i£ V : l : rtfi R ; m\ T K ; i; \ Ag| 
Murine hepatitis pol 1ab (5383) SV A S \ J . . s . l . K ' i. r . : ys, 
Consensus (5500) LQS V G A C V V C S SQT S LRC G SC IRK P LLCC KCC YDH V MA T 
— . — , Section 143 

(5539) 5539 J555Q 5560 _ 5577 

avian infectious bronchitis poMab (4906) f>\- s V - V" ^L0iO£A . 1 : ".MSiTCG - > 
bovine coronavirus pol 1ab (5335) t I * V l s.vs ■ V " N ^ „ D v / 1 • K3 t Y ED * 

Human corona 229E pol 1ab (5033) D'Br V \ : AIT' v. NT :* v:c : h y. vdim 

Murine hepatitis pol 1ab (5422) ''^fcYj/i^j^ 1 >*V.M7sr* riVK -* '^:m.S Y ED- . 

Consensus (5539) b H K Y V LSI S P YVCNS PGC 6 V N DV T KLYL6G MS YYCSDH K 

- - . — > Section 144 

(5578) 5578 ,5590 ,5600 5616 

avian infectious bronchitis pol 1ab (4945) pf l£ P VSJ: TJv : * i t RA . C - . - E N>v D> LA?T>4fgl 
bovine coronavirus poi 1ab (5374) Q,Y>>K ! VMN V|-J7 j-? ^VKQS CT, ^PY-r D P r :»*R X A S c; k\ * T 
Human corona 229E pol 1ab (5072) ^{fL^F P CSA N^VgL K.S- l: -HDIDv'-kl'ST- *.JS 
Murine hepatitis pol 1ab (5461) IDMSBK ^ 

Consensus (5578) PQYS F PL VS N6MVFGLYKQSCTGS PYIDDFNK I AS CKWS 



PP20480.019 



38/199 

FIGURE 4A (contd.) 



Section 145 

(5617) 5617 5630 5640 5655 

avian infectious bronchitis pol 1ab (4984) 1 vjj,P * , A A fCSDSlkR! 1 <Ja kvTp»^I>HF Q^f^S^E^ 
bovine coronavirus pol 1ab (5413) DV^jj^^ \, Kh ?V AE i*Q3KiiT I \Q S V-\ S A1VI 
Human corona 229E pol 1ab (51 1 1 ) B|jRp^K :fi D/, [< ES_L RL FAA AA\vjA i*K g g^Vfes S^^yS:^! 
Murine hepatitis pol 1ab (5500) p^f^f^Ktl|^Of fi§ £ ? i?T*>iT). ir:-3*"Qic^^: ^S"^' -it Q- f --5?;?V sy>V*4 s ^p 
Consensus (56 1 7 J DVDD Y I LAN ECTE S L K L F A A £ T V K AT EE AFKQ S Y AS AT I 
„ _ Section 146 

(5656) 5656 _ .5670 5680 5694 

avian infectious bronchitis pol 1ab (5023) R uyjFS DR» • : I : sv; v pa: . 1 R 7 r. _ r l >; y * GYliF <-R ; S>w 
bovine coronavirus pol 1ab (5452) U^xlv%gETL I * ijVuGi^VK v L<„\Ki Y ^ t 1 GYLV:KNCr\T 
Human corona 229E pol 1ab (5150) K f V\'A : < ?¥.'\ ' liLL-Vv */s<rl\A*jK K %fc ,*R-i s - .r;f v ^ K::<§jkr 
Murine hepatitis poMab (5539) RfivsDA - < S > 1 1 A: :<VR7Ai \K : ':YHF~ilNC:4jr 

Consensus (5656) REIVSDEBLILSWEIG K V K P P L N KN Y V T G V H F T K US K T 

- - - - - - - - - - - Section 147 

(5695) 5695 5700 5710 5720 _ 5733 

avian infectious bronchitis pol 1ab(5062) Q'UlMFfrrE? > EG\;v-^Y;, KA \ s\: A V ASV DI«'V:i ^SA-N 
bovine coronavirus pol 1ab (5491) 
Human corona 229E pol 1ab (5189) F, FV E v rxYGS DT^Ttfp^t avt h .,v p . hi; j ; hYiN 
Murine hepatitis pol 1 ab (5578) ^llggf §K si^N% - yi5f,3R A ; ' T 7 v s v s v 7 1/ aaa .-A 
Consensus (5695) V L G E F V F 5 K S E L T S G V Y Y K A T T T Y K h S V GDVF f LTS H N 
_ _ — Section 148 

(5734) 5734 5740 5750 5760 6772 

avian infectious bronchitis pol 1ab (5100) WS . V . ; LCP O Q T F S R F © R »? N V I \ F-CKV -X'l^LVT 
bovine coronavirus pol 1 ab (5529) < A ' . 3 L v 7 P 7 - X Y S S I R ' A S V Y s ; 7 i = A ■ N V V stj I ^ 
Human corona 229E pol 1ab (5228) A ,R M K UYSllYKLlhSF' < D Y A lvrV^ 
Murine hepatitis pol 1ab (5616) "ss s . . lv? k ~ A Y 7 s J H 4 A s v Y » f, t fq n-v p n yq 

Consensus (5734) VASLSAPTLVPQ E N Y T S I R LA S V V S V P E T F Q N N V PfcJYQ 
. . - Section 149 

(5773) 5773 , 5780 5790 5800 5811 

avian infectious bronchitis pol 1ab (5139) jSgpKg^RT V • , ' S^y<>>\ f <\AC J 1 ■ A ' 'F^S <y.\^r/.C^ 
bovine coronavirus pol 1ab(5567) H T j (, ll ' R Y C J / j ' r*T,;-{! ^ 'i L A r "LA Y .7 T V : A^' 
Human corona 229E pol 1ab (5267) Z]Z kqrtt T.. . < 3 tW^ca* -IC i PC ;i -FA AC 
Murine hepatitis pol 1ab(5654) H'X> M * RYC v r T A A 7 4 ; T, A * - 7 Av ' Y C 7 7 v/bAA 
Consensus (5773) LI G MQRY TTVQGPP G S G K S H LA I G L A V Y Y C T A R V V? T AC 

Section 1 50 

(5812) 5812 5820 6830 5840 5850 

avian infectious bronchitis pol 1ab (5178) :'!.*. . A A' 'KFl. >V :p. - 7 QR TID FSKA'P 
bovine coronavirus pol 1 ab (5606) r H ' T A : A 7 Y K F L blpUD % 7 v A K v r ' E : Y ; X r A r 
Human corona 229E pol 1ab (5306) -* • S A . V i AY./v" / ' A AR R e Y,SG1*K>" 

Murine hepatitis pot 1ab (5693) < ... A A HKfbMTIAO - V AKYF D Y 17"^ 
Consensus (5812) S HA A V DA LCE K A H K F h N IN OCT R I V PA KVR VDCYSKFKI 



PP20480.019 



39/199 

FIGURE 4A (contd.) 

- - _ - Section 151 

(5851) 5851 .5860 5870 r _ „ ? §8B9 

avian infectious bronchitis pol tab (5217) r P TGKK - &?&y.vv k &h&W:SC ! '.< vifctfjV : KTJT ^ r.EfaiP Fl 
bovine coronavirus pol 1ab(5645) ^dtVr-kvv-t iu f i mVia i vv j i jA| * aea%V'A 
Human corona 229E pol 1ab (5345) L :r>SA; y WS, 1 V,A i../^VN, 1 * VV ■. -A -AC :::b;. -VI 
Murine hepatitis pol 1 ab (5732) ?; bTT R K /V?l I A A ! ,J Lvr ; I V < 1 .L - n k hi . v' 1 
Consensus (5851 ) N D T T R K Y V ITS TIN ALP E V VT D I V V V DEVSML T NY EL S V X 
___„ — . = ~~ Section 152 

(5890) 5890 ,5900 5910 t 5928 

avian infectious bronchitis pol 1ab (5256) . X \ LY YY .v h-aoi : A : I G-Si ' KDT:V "X 
bovine coronavirus pol 1ab (5684) AA7PAKMY T A, . . v _ L S KGTX E KYFIA 
Human corona 229E pol 1 ab (5384) Mi J A I 3 Y K r I V * • o * . V . T 5 - A . M A * T D Y V v « 
Murine hepatitis poMab (5771) >.?RVSAKCY' :I ' : A ' ^ 1 ^ L- ACT M. *' P. Y F T * S 

Consensus (5890) N A R I S Y K H Y VYIGDP A Q L P AP RY LLSKGTLEPKYFNVVT 

- _ — - - __ — Section 153 

■ (5929) 5929 _ 3940 .5950 ..5967 

avian infectious bronchitis pol 1ab (5294) r L* VCVK ' I /..A"-- ' • : D* 'A * . U 1A' N 

bovine coronavirus pol 1ab (5723) KL* CCi.G : ia lGT ! ^ : -K"„ d • A. '^"LBKS 
Human corona 229E pot 1ab (5423) * R c IG . V - H * A . . e . EX= • >VPV- 
Murine hepatitis poMab (5810) Ki, cc:.g : . G t - k ■ n a ' . ■ *.n lkakn 

Consensus (5929) K L M C C i G P 0 1 F L G T C Y R C P K E I V D T V S ALV YE NK LKAKN 
Section 154 

(5968) 5968 5980 5990 w 6006 

avian infectious bronchitis poMab (5333) V AC viv :GNSDVtHi:s . AY .IT L VK'i-VC 

bovine coronavirus pol 1ab (5762) ! VA/KG -V'THhiss AV ^"lYLIAKi L K 

Human corona 229E pol 1 ab (5462 ) r A . Q - I A £ R G S V Q V D Sf P - L V V K R I, 

Murine hepatitis poi lab (5849) D A~ M < 'AAAG Q THZ3S AV MQ-.HHW KALK 

Consensus (5968) EAS SLC FKV Y YKG VTHESSS AVNMQQXHLI KKFLK 

_ ___ . Section 155 

(6007) 6007 6020 6030 8045 

avian infectious bronchitis pol 1ab (5372) ?* K ' H r " " ' AM qr y m n S 

bovine coronavirus pol 1ab (5798) A PL-HK/Vi ' . : SCA F K V QT . ' A„ ' ; . 
Human corona 229 E pol 1ab (5498) K T HK-V SA v; A l CA A 

Murine hepatitis poMab (5885) A rs s v ?s ? vv - v a-t 

Consensus (6007) AMPS W SKAVF I ^$ P YHSQN YVAKEVLG LQTQT V DS AQGS E 

. - Section 156 

(6046) 6046 ,6060 6070 _ 6084 

avian infectious bronchitis poMab (5411) - Y A v . A DS Q ' ' ATA T . h \ . R LVV- RQR 

bovine coronavirus poM ab (5837) • V ' Y 5J Q A E T A ' SV V , . ; ' I A L C V ' ; S N H 
Human corona 22 9 E pol 1ab (5537) ' Y FAQ'. SUTA AC:.-' .1' -K -Al'3 R 

Murine hepatitis pol 1ab (5924) F YSC: A ETA SV V . ! . r- K AAV 0.-M 

Consensus (6046) Y DYV I FSQTADT AHAVNVN RFHVA XT RAKKG I LCVMSHR 



PP20480.019 



40/199 

FIGURE 4A (contd.) 



_ . Section 157 

(6085) 6085 £090 £100 6110 _ 6123 



avian infectious bronchitis pol lab (5450) D E I, Y$A X* KFpS ELDSKTK LQG£. IC KKF|F-S 

bovine coronavirus pol 1ab (5876) q;l FEA^^^^^J^KMPQ AY E T R : y<^^|ltc^g8:^§^: 
Human corona 229E pol 1ab (5576) tl^^Kf F Sp^T Bil $B</b -CC-Jr* D"C;ARN PI 



Murine hepatitis poi 1ab (5963) QL FES ln.f* t LT'l DKlfj N PJ^^ftr »r i>^SRS'Y.V 

Consensus (6085) QLFEALN FT T LTLDK I N RLQC STNL FKDC SKS Y S 
_ — Section 158 

(6124) 6124 ,6130,., 6140 6150 .-JB.162 

avian infectious h rori ^ h;fic r> ° 1 1 u**. >• \pk if a»t-^ a • v v * tfp * ~ t ?i - . c 

bovine < 

Human corona ^zye poi lau ^oouy; u l v .* bit - :r- 

Murine hepatitis pol 1ab (5999) GVH--AH PSFPAVDPKYPY . VC L>i .V A V. S - A V * S R 
Consensus (6124) GYHPAHA PS F LALDDK YKVSGDLAVCLNVADS AVTYSR 
„ , — . _ Section 159 

(6163) 6163 ,6170 6180 ,6190 6201 

avian infectious bronchitis poi 1 ab (552 1 ) L K M <" V ,VE' C H P M " J _*? i> £ i r , g . V F V 

bovine coronavirus poMab (5953) L . km " **K PD VT PL* YCKL i KEK VKk PA'AmF-'-, 
Human corona 229E poi 1ab (5646) v ' ym: p • t>vsM?/SH - i> C RDF-.MR. ..g l«. : - V. 
Murine hepatitis poi 1 ab (6037) p ■ ' i. M ' k P u L '; p v C ki, Pi; k . v k r .-a • v « . f 

Consensus (6163) li slmgfkldvtLdgyhkl fitrbeaxkrvrawvgfdve 
- - - — - - : — - — „ Section 160 



(6202 ) 6202 . 6210 6220 6230 6240 

avian infectious bronchitis pol 1ab (5560) ||r^CGT P >I V < ~ T A V T P E P V DTS I ON 

bovine coronavirus pol 1ab (5992) Q A ;A TP HS;! F P L - ' 'i i V }• AT P 1 F A DR D. = Y 

Human corona 229E pol 1ab (5685) GA ; . I'GL: : V V L V - V PE :c Vl?NT(';S 

Murine hepatitis pol 1ab (6076) G A A I R 0 S T " F L L T : V P AT M F A K Rp* Y 
Consensus (6202) G AH A T RDS X GT M F PLQLG FS TG ID FVVE PTGL VATRDG Y 

_ - - ^— — — Section 161 

(6241 ) 6241 _ 6250 3260 _ ^ 6279 

avian infectious bronchitis pol 1ab (5599) r<r -pvnsk> s.kv s A P I p r. 
bovine coronavirus pol 1 ab (6031 ) S F K K v A K K I ) P x T b G g R L V P r 
Human corona 229E pol 1ab (5724) VVKPVRAR, . I VP LRKG0P ' L K ■ .r:l. 
Murine hepatitis pol lab (6115) vtk-K, : -. AA-r;-.i r P - , : >. k.:_ I pLms.s.gqk'-: d-'V'/.j p '%-: : p;;^l 
Con sensus (624 1) VFKPV A K A & V> G BQFK H L I P LM S RGQPWDVVRP R I V Q M L 
^ — Section 162 



(6280) 6280 6290 .6300 6318 

avi an infectious bronchitis poi 1 ab (5638) A V 3 c V - T H G L T I . V P I -: SQVCS 

bovine coronavirus poMab (6070) A pi i :>! s CV L T A A ? CP V" i^TSCNV 

Human corona 229E pol 1ab (5763) A.. ~ A SS' VL I L AGCL T H x V I'-AVKHCQ 
Murine hepatitis pol 1ab (6154) s H A D I A r V L T A A ;CL V REV V C V 

Consensus (6280) ADH LADLS DCVVI VTWAAGLELT T LR Y FVK ^1 GRE V CCV 



PP20480.019 



41/199 

FIGURE 4A (contd.) 



: , , — Section 163 

(6319) 6319 _ _ J>330 0340 6357 

avian infectious bronchitis pol lab (5676) CQpL^:\TiE-k: ^!4^^^i^^iC^?^^iV€l^%^ ! ii^lilS 
bovine coronavirus poMab (6109) cfjK>R^ i -y: ..ft& v g. wr i svTCVYy, J *, t.Vv f»v ..^ h„ 
Human corona 229E po! 1ab (5601) CGTV>.:ei:<;;, v;s!nd) CCFr i ; gc-'.v/v., < : vvra. r&V-'ci 
Murine hepatitis pol 1ab (6193) ojgKR* \ cff >ra'cY:.cj. Vr 'syh'Wl;':. liv/j 

Consensus (631 9) C T K R A T C F N S R T GY Y G C W K H S L G C D Y L YNPjL I V D I QQW G 

_ — Section 164 

(6358) 6358 6370 6380 6396 
avian infectious bronchitis pol 1ab (5715) VS«* V ; F*. DLH -GHA V . . . .IN L/ie 
bovine coronavirus pol lab (6148) ,!< H S3 D L Y S a KG A V.v^s';, / , • VYDC/C 
Human corona 229EpoMab (5840) ' ^js. sy* : A ] ' i' HUE - — .:.v^DC:V 
Murine hepatitis pol 1ab (6232) < f? i ^sfet^P-j -\S * ! KG A — s" - , . '\VHDC\C 
Consensus (6358) Y TGSL Ssk H D L I CSV H KGA H V A SSDAIMTRC LAV Y DC PC 
„_™_»__^ - — — . — Section 166 

(6397) 6397 _ £410. _ 6420 6435 

avian infectious bronchitis poMab (5754) v - VN • LT • li * A D-.v ssc YL ^MVLW "VDi-LKV 
bovine coronavirus poi 1ab (6187) UVE . I SF \L S<i JfS£ r<VL'I RV ML K -amuCNHY 

Human corona 229E pol 1 ab (5879) m: v - I T ' ~ M ' A' -MAI , . ' G . V 3 1 5 1 M P . A PKLY , P 
Murine hepatitis pol 1 ab (627 1 ) K _ V K M L E* IS V s V T s C F LL/, F^fM J: MllfMfiS N * Y 
Consensus (6397) KM V HW NLT Y P X I AN EL SIN TSC RLLQRVM LRAAM LC NRY 

- - _ . - >■ — Section 166 

(6436) 6436 ; 6450 6460 6474 

avian infectious bronchitis pol 1 ab (5793) N V.V Y . F. G I - KC-'R p. G D V .\; fc R h K K V R V Q & 

bovine coronavirus pol 1 ab (6226) T L ■ : Y ~ - - % i j\ c V K D F B b K £ A | I V K S TL'L 

Human corona 229E pol 1ab (5918) KA1 H \ K- - GI .V DA W.YC -KK -. N TIE 

Murine hepatitis pol 1ab (6310) DV'CY ; GLACVK; ydi- K F A vvks fq; v 

Consensus (6436) VCYDIGHPK A I AC VK D F D F K F Y D K N PI VKSVK.TLE 
_ — - Section 167 

(6475) 6475 6480 6490 6500 6513 

.avian infectious bronchitis po! 1ab (5831) UY 0 KDKL-A • '■ • M '• C FMFL Y F- 1 

bovine coronavirus pol lab (6262) ' F F A K i > s :< • . X ' - K . P :•: a v F . V L 
Human corona 229E po! 1ab (5955) . DYM G — QM . ~ . L ~ 'S M . E F Si * ,F ^ R 

Murine hepatitis poll ab (6346) ! YEA KDO-- ' K ANAV " F . VI, 

Consensus (6475) Y D YEA HKD FLDGL C M F W M C N V D KYPD HA VVCRFDTRVL 

~ — Section 168 

(6514) 6514 6520 6530 6540 6552 

avian infectious bronchitis pol 1 ab (5870) §v F p c K Y I K r J R : s F K A K . * 

bovine coronavirus poMab (6301) ML { J C . K M KPt RAAFE , PK 

Human corona 229E pol 1ab (5992) STL E.OV H ? A Y-D.KR A A, PA 

Murine hepatitis pol 1 ab (6385) ' KL, F C i " K . I . F F T R A A FEN P M 

Consensus (6514) S LHL PGCNGGS L YVNKHA FHT P PF DRAAFEMLK PM PFF 



PP20480.019 



42/199 

FIGURE 4A (contd.) 

m „ — , Section 169 

(6553) 6553 6560 £570 _ 6580 w 6591 

avian infectious bronchitis poi 1ab (5909) $k d s^R&ET&Q V DC^A'g^>LV'S;i>A-gK DC 1 r^&iG^Vv; 
bovine corona virus pol 1ab (6340) 
Human corona 229E pol 1ab (6031) ^^D^GS^E^ii DQVN « - - - r V^3: : %^fi^^;5 : ^Q' 
Murine hepatitis pol 1ab (6424) : £ ^'t^ t %M^M^gK<SvpY R * s A ^2 ! <P * ^ 

Consensus (6553) Y Y SDT P CVYMD GM DA K QVDY VPLRS ATC ITKCNIGGAVC 

— — Section 170 

(6592) 6592 6600 _ _JB610 6620 6630 

avian infectious bronchitis poi 1ab (5947) K K iKKQM^A r M^T ^^X^KvW^WW y T K X L a P YXJ* r 4 

bovine coronavirus pol 1ab (6379) 'ft^AfcB'^R^Yg^^ r jT A^'ic Y K F p 7 YY I^Np 
Human corona 229E pol 1ab (6066) SY< \>, ; L v 'RA>Y\e ; * F&Q:A€PN a v^VP'l'T O'DC r \ 1 VQ,T i 
Murine hepatitis pol 1 ab (6463) bkh At;^ &g y1|e£ ^ : t a tt ,\ • - r r 7 : • « y k ? f b f * y - 7 n f 7 
Consensus (6592) LKHABE Y RE YLES Y N T A T T AGE" T F H VYKTFDFYNLWH T F 
— ~~ — — Section 171 

(6631) 6631 6640 6650 6669 

avian infectious bronchitis pol 1 ab (5986) §Aii- A c f> M fTVlTGDKV^ 

bovine coronavirus pol tab (6418) : ' D>i 

Human corona 229E pol 1ab (6105) grgj^N l£ - kkS^^I^^^Ss^VG A D L ' " A * 5 G D. : . i? v 
Murine hepatitis pol 1ab (6502) T&L — s L K « V V Y ; l.VMS H^PCR* ' L.* C AV I GE K- VI 7 
Consensus (6631) TKL QSLEN IV YNLVNAGH FDG AG EL PC A I IGDKVFV 

, , Section 172 

(6670) 6670 6680 6690 ; .6708 

avian infectious bronchitis pol lab (6023) XDQ VEK-*! v ;q" tl 7* r v ^ F ,f i^tl^i.nS 
bovine coronavirus pol 1 ab (6455) K X ' - b u v y 1 I - 7 * T Y " r n v 7 y i " f 7 : X H H > 7 L*. L* 7 
Human corona 229E poi 1ab (6144) RDGNT'DNLV i V £Kf Js®^iwM<^. f- . vg i t l- L * ±>L 
Murine hepatitis pol 1ab (6539) KIQNEDVW K N \ I :mv viy f - i B py ; if| (gfjffiF 
Consensus (6670) KIQN E DVVV FVM H T T L P T M V A V E L F AKRS I R H P B L K I L 
Section 173 

(6709) 6709_ 6720 6730 6747 

avian infectious bronchitis poMab (6062) K\ . VDVTI.G T . * S .T P L?Y RN V* A 4 ' - -D:EP 
bovine coronavirus pol 1 ab (6494) jg&j g I 7 " c y 7 H ' T • ^ a rj : s l C - : : Y g m ' D L k§ i]fS 
Human corona 229E pol 1ab (6183) If&SC^VAT^ki^^^^ G 

Murine hepatitis pol 1 ab (6578) ghjfi^DV c ^ : H v L : { n v I c: :^ : : Y K k : l Dp g c^ET 
Consensus (6709) K N LN IDVT^K H V I W D Y A K E S P L C SNTYKVCAYTDLDFIE 
— — ~~~ — Section 174 

(6748) 6748 6760 ,6770 _ 6786 

avian infectious bronchitis pol 1ab(6099) NGLV^LYD^. Y : 'd'y 3.' LAAD . A LVSTy^'Y PYSy|ei 
bovine coronavirus pol lab(6533) K-hNVL 7 dgrdn • .al^A 7KRS-N .g 'yisttkv slshi kg 

Human corona 229Epol1ab (6220) DV C ' CYD'.S IQ' 5YSR* ~LST A' L 3 T A V ' T G K S — 
Murine hepatitis pol 1ab (6617) s L N y LFDCRDN ALEA-K.fe R-GV^P'TTKI. S L- S M I KG 
Consensus (6748) LN V L F D G R D N 6 A Y £ A F K K S NAVY I STTKVKS LSM I KG 



PP20480.019 



43/199 

FIGURE 4A (contd.) 



_ „ „ Section 175 

(6787) 6787 ,6800 6810 _ 6825 

avian infectious bronchitis pol 1ab (6138) ^S^LL^^g|Apl^dS;aS - - - — - ^LYpKfeVN- 

bovine coronavirus pol 1ab (6572) §^^^X^00^§p^C V^|A'y^T^G^0^^§§ Fj&Ls* 
Human corona 229E pol 1ab (6257) L^A I J?^Vg^0.^GnSi AT&KS E D^Ni^Nf jIwf^V^K^G 
Murine hepatitis pol 1ab (6656) |PQf^ 

Consensus (6787) P VradLNG VM VDKVGDS DV FWFAVRK DGN DV IF S R DS 
— — _ Section 176 

(6826) 6826 ,6840 6850 6864 

avian infectious bronchitis pol 1ab (6163) - — - — £;A FVTLPN T^&lfi H S Y 

bovine coronavirus pol 1ab (6611) -gR^f.S NQ SiE$S$L j$Sp<B P^NgG^^^T sfe^^il^S 

Human corona 229E pol 1ab (6296) K pyBjf^Dc/- - - - - - ----- - - - - - 1~ _ s _ f g:v i^v$£ 

Murine hepatitis pol 1ab (6695) & E pS'ii'^R^^^jS^ P £ GjkjR ViG^D Bs 'GViEJyh'Ji Pv G 1\ i^^K^'^J^^; 

Consensus (6826) L VSH Y SPQ.GN G N G L GNDALA T I FTQS RLL 
Section 177 

(6865) 6865 ,6870 £880^ 6890 6903 

avian infectious bronchitis pol 1ab (6180) EpiEi^^^ <<'L>& ^iU^E' 

bovine coronavirus pol 1ab (6650) S3 :c i'FDM :k "lALD DLFIQK* Gi F. • Y A F E IV **\ 
Human corona 229E pol 1ab (6312) Q DfgL^^^T 

Murine hepatitis pol 1ab(6734) r?^S^^^^ftiS^ D i^^Jf^S A S:;# ^ ^ ^ ; P Y A r l& V V Y & 
Consensus (6865) S S F TP rVdMBK D F L ALDD DV F I Q KY GLE DY A FEH IV YGD 
. , _ , — _ Section 178 

(6904 ) 6904 _ ,69 10 _ J6?2Q 6930 6942 

avian infectious bronchitis pol 1ab (6218) ppKPQL^ Ty 1 i- tiyfh l* U R A K V : ;na v s DC * VM\ h 
bovine coronavirus pol 1ab (6689) VtlQ'K i iV ' 11 Btl^ii^^g^QT s K .VlvE: VSYDS3Ih1x« 
Human corona 229E pol 1 ab (6351 ) Vi :< T L ; > L I rrQVfksi M ; J K A - l 1' \4 AS? IT L \ C 
Murine hepatitis pol 1ab (6773) Fg^K T I '. : C;i - H ; » T* I G, I? A :l R Q Q K s!h \ VI^Hjyff/Y DS'S'I KSjY; 

Consensus (6904) VNQKI IGGLHLLIGLYRRQQ SNLVIQE F VS YDS SIRS Y 

_ — - _ — _ Section 179 

(6943) 6943 6950 6960 6970 6981 

avian infectious bronchitis pol 1ab (6257) pFjgLb DriG- 3 Y « ' VV ~ L L L L R 1 LKEYGTN 
bovine coronavirus pol 1ab (6728) i L I 1)E KS-G -'-Sr 3 r i V 1 1 ? : . VALV K S*Jn$N£ V j - 
Human corona 229E pol 1ab(6390) C TjgTY LtfD P V- T . Y M • ^ ( v vlkll: LTvyf- 
Murine hepatitis pol 1ab (6812) D^N S G- I'V^V 5 ''*! VI " L • , / . . V I V K s i,Nji;K5y.S - 
Consensus (6943) FIVDE SG SSK SVC T V I D L L 1* D DFVE LLKSLNL CVS 
, — __ Section 180 

(6982) 6982 .6990 7000 7010 7020 

avian infectious bronchitis poMab (6295) KS^\ v. > I Y, iHF' :t>F:CG IK"C V;i"Qg--A^ 
bovine coronavirus poMab (6765) --#<>:vn NV f.kdfo : 6rUi - 2 E R AASD r 
Human corona 229E pol 1ab(6428) --K^hc . 1 1 ^KPW^r L^e- i: ava'T vfs|-||i|'|f 
Murine hepatitis pol 1 ab (6849) - - A v v K \ N V F K p F Q F v. L : ; c n 5 F, r V* f i : R . A A A D K 

Consensus (6982) kvvnvnidfkdfqfmlwchde'kvmtfyprlqaaadwk 



PP20480.019 



44/199 

FIGURE 4A (contd.) 



, — , Section 181 

(7021 ) 7021 7030 ,7040 .7059 

avian infectious bronchitis pol 1ab (6332) C <£^#b>" B#^v5^C VHi$ P CN^ V G^p^feSjS^ii^^ 
bovine coronavirus pol 1ab (6802) [py^ ]:i\V VL^XYLN S PMiiR^XW^ K PV T l/P^^C^i^yA: 
Human corona 229E pol 1ab (6464) C&£;SS$gE£& 

Murine hepatitis pol 1ab (6886) g&Jwfc'E ^ 'V^M^^^G f C 3^H^£ 
Consensus (7021 ) P G Y S MP V L Y K YQN S P L E RVN L W W Y G K P I T L P S G I MM N V A 
- . Section 182 

(7060) 7060 ,7070 _ 7080 , 7098 

avian infectious bronchitis pol 1ab (6371) * v^'c <I.S >s i; -I C v H r >,\ MM « ^ f x4 i DK xYr, 3 j 
bovine coronavirus pol 1ab(6841) Y'Y. feiS T. :T\ ~i KA w£ V r<MHVL»h' T. Y r* EK'V.^.eCi^ 
Human corona 229E pol 1ab (6503) Kx-^i\-) _yS1 1 I.'C>J CH^kli YLH hi i\< mY&X^^Z§m 
Murine hepatitis pol 1ab (6925) Y,,> Y -LS'T. - l L R'A^xv'L;? L. . *</ M)k^ A P r fAV 
Consensus (7060) K Y T QLCQ Y L ST T TLC VPHNMRVLH LG AGSDKGVAPGSAV 
w _ - - __ .- Section 183 

(7099) 7099 ,7110 „„£120 7137 

avian infectious bronchitis pol 1ab (6410) i^KQ^Ei l .L- "' IVDY ' AH' svr DCNirYNTSHK 
bovine coronavirus pol 1ab (6880) S^QMIyy^ 
Human corona 229E pol 1ab (6542) ^K'.-r H I V V'ldy^v . ADFSVT^A^y^DK 
Murine hepatitis pol 1ab (6964) x j*Ql' > ' A G 5J £ II S^N^jgjVi.S ^'S^v\ A S-Y<Y G y ^ 3 f r^P rTjp§. 

Consensus (7099) LK Q W L PAGTI LVDND V V PFVSDAVASYFGDCI TL P F D C Q 
~ — — __ Section 184 

(7138) 7138 7150 7160 7176 

avian infectious bronchitis pol 1 ab (6449) V \ ' V : 1 V D N D K R K ' ; B = \ X A N M G M O " V : I Y^ssj^p 

bovine coronavirus pol 1ab (6919) w^ ; - 1 "Pi rkKfYY <yn j dgf t i'CH \ l*H 

Human corona 229E pol 1 ab (6581 ) 'kv ■> L PGR V K A i*GE N ~K EG Y T INGFI 'C 

Murine hepatitis pol 1ab{7003) to \Y>;i " DFLTKKIGKYM / f> K - - ~ - DGf Y Tj £-H Y\TR 
Consensus (7138) FDL 1 1 S DMY D pFtKN IGEYNVSK DGFFT YXCHFX P. 
- - - — — Section 185 

(7177) 7177 7190 7200 7215 

avian infectious bronchitis poMab (6488) i r> ' - * T V V""T^I EV • D f AO 'CAW H.u'AV 
bovine coronavirus pol 1ab (6954) r. K . L " * * V n 1 Y 1/ ' ri-^UAE . ' LMGYYAL V-;;!^ 
Human corona 229E pol 1ab (6616) E*K 1 IF V Y Y , N K Y ~ E LV.0 R i S r - . K . 2 vr Vf 
Murine hepatitis pol 1ab(7038) '§K j. 7 r * ' v ^ r\i" M r r:NAS .■ * LKG K - AFv;: pt .YNV 
Consensus (7 1 77) D K L A h G G S V A i K I T E ITS W N A EL YDLMQK FAFWTH FCTNV 
„„ „„ . — — — Section 186 

(7216) 7216 7230 7240 _ 7254 

avian infectious bronchitis pol 1ab (6527) ^ A;: r :: A LI ' V i ■ Y F G - A . E K v l- y „ ^'j'lK. : pi Vti< 
bovine coronavirus pol 1ab (6993) WH\:r - g j, r ' I u Y-1G - - K p k v E 1 V N IY l s 

Human corona 229E pol 1ab (6655) r * * A "VV> - Z Y DG'DFAQG PF 1 U Nil* '\V « i L> 
Murine hepatitis pol 1ab (7077) \ A ' y G YI T uto;£:J - - KT R £YD ^ TM.i .u i Li ! Y Y-Y- S 
Consensus (7216) WAS SSEAPLIGlil YLG K KVE X DGNTMHAN YLFW RNS 



PP20480.019 



45/199 

FIGURE 4A (contd.) 



. Section 187 

(7255) 7255 7260 7270 7280 _ _ ...7293 

avian infectious bronchitis pol 1ab (6565) iv h ££&V a K ? D L RL kfe P yv N LrK-T K T -Dlfey F N 
bovine coronavirus pol 1ab (7030) ^V\WNGt>AY^ A : IV J lRADQI N*Dlg4£ Y§' 

Human corona 229E pol 1ab (6694) T,VM: L'sjYNsV.v TjL sV,£-NC K : H^fe 7 v V$l ■ K; D -> T - fl; N'&ffV L*S ! 

Murine hepatitis pol 1 ab (71 14) n q g a y s 3 , b' i 'Ms - F : p;k% ; a A r G& a v; y :*; i, k p n q>n p %' l s 

Consensus (7255) TVW N G S AY S L F D M A K F P L K L K AT A V V N L K DQINDLVLS 

, _ — — Section 188 

(7294) 7294 : 7300 7319 
avian infectious bronchitis pol 1ab (6604) j/jjK CG k*JL L vi:H;D-ViG^ T sK^'isp^b'^'c t m se Q id N0: 9905 
bovine coronavirus poll ab (7069) LkKOK! ".V- DT *: K EA'V V'- DS L * iWa SEQ id no: 9886 
Human corona 229EpoMab (6733) U\f * - - I KCI?S FpK HC^lg SE Q ID N0: 9914 
Murine hepatitis pol 1ab,(71 53) f>lRK ki. ,v dtrkevf^dsl 1 nvk seq id no: 9887 
Consensus (7294) LIE KG KLL V R DTG K E V FVSDSL V N V K 



PP20480.019 



46/199 

FIGURE 4B 



human coronavirus OC43 NP 
Bovine corona NP 
avian infectious bronchitis virus NP 
mouse hepatitis virus NP 
Consensus 



(1) 
(1) 
d) 
(1) 
(1) 
(D 




M S Frv P G G Z M 'Kfe G H SjS^Vgj^G^^; I LK KIT \vA G qSIb G Pfl. 

rIssgSrsghgilk 



MSFTPGKQSSS 



WADQSDQARN 
Section 2 

78 



human coronavirus OC43 NP 
Bovine corona NP 
avian infectious bronchitis vims NP 
mouse hepatitis virus NP 
Consensus 




VQTRGRRAQPKQTATSQ 



human coronavirus OC43 NP 
Bovine corona NP 
avian infectious bronchitis virus NP 
mouse hepatitis virus NP 
Consensus 



79 _ ^ . ,90 

G r K^F:EF : -v -P* X APGV PATLAIK 
C E< S F G FA , 0 V 7. A P G V PA T rJ A K 



(79) 
(75) 
(75) 
(45) 

(78) G K KF P A . 0 V ' L A N G I ? A S V Q K 
(79) 



PSGGN VV P YYSW FSG ITQFQK 

Section 3 

100 117 



hi- Ad 



V"-HN g's; 
V H EC F3 • (TAD 

rg j a'- ~ ~ g % : 



(118) 

human coronavirus OC43 NP (1 14) 
Bovine corona NP (114) 
avian infectious bronchitis virus NP (82) 
mouse hepatitis virus NP (117) 
Consensus (118) 



(157) 

human coronavirus OC43 NP (153) 
Bovine corona NP (153) 
avian infectious bronchitis virus NP (121) 
mouse hepatitis virus NP (156) 
Consensus (157) 



GKEFEFAEGQGVPI A PG VPAS BQKG YWY RHN RRS FKT AD 

— - — — - - - Section 4 

118 _ .130 „ J40 156 

KG Eg LI, PP.: • 1 K KT.QY T i 'J- VY ,G^3Q 

N O R C L L P P. L - H K L* ^ / T 1 '_> 7 F ~ ' S N G 

- 'KPVPDA. T - A LUvJ P3QG IV . A >G 

CO kg: r pp. l :i ga y dsi :-: vf g:^@ 

GNQKQL LP RwVf Y YLGTG P H AK DQ YGT S I DG VFWVASNQ 
— Section 5 

195 



157 



170 



vir: pad f v'. 

VP 7 ? A P I I j 
V SRS'GGT 



; A !■■ 



180 

i; P : P;: - ~ 
'^BG~~ 

FDQY - hi D -GPDGNFRWUBflP 
:3'S-.EAI, T A P P V L ?Q G F Y V 



ADVNTRA DIVDRDPSSDEAI PTRFPPG 



(196) 

human coronavirus OC43 NP (189) 
Bovine corona NP (189) 
avian infectious bronchitis virus NP (180) 
mouse hepatitis virus NP (192) 
Consensus (198) 



196 

i 



,210 



220 



TVLPQGFYI 
__. Section 6 
234 



GS APMr.^n? F S A S S A G £ P r : F A G 5- Ti TPT - G 

G S J A P N 5 •< 3 7 P A S A S S A G f R f> RAG S G £G- T P T G 

L ' P GR " T A A S - A A 3 . : * .i G GRR " G 

EGS A P A3 P 3 G PS * G. )H~ PAPS G " *Q "Q BG T 
E G S G R S A i? H SR STSRASSRASSAGS RSRAN S GH RT FT S G 



PP20480.019 



47/199 

FIGURE 4B (contd.) 



_ Section 7 

(235) 235 240 _ 250_ . 2§P. ..„._.___273 





i hepatitis virus NP (229) mi 

Consensus (235) VT PDMADQIAS LVLAKLGKDATKPQQVTKQTAKEVRQKI 
. - — Section 8 

(274) 274 280 m J90 _ 300 . . vt 312 

human coronavirus OC43 NP (267) 
Bovine corona NP I 

avian infectious bronchitis virus NP (227) r. u *y t- i K^i-yi 

mouse hepatitis virus NP (268) XpNX^RRQ K -.gPH%Q C k Q'l'C * - K - c- F N 0 - - % T r 4 G °-ftHXL 
Consensus (274) LNKPRQKRS PNKQCTvIqQCFGKRGPNQ NFGGGEMLKL 
, _ - — , - — Section 9 

(313) 313 .. .320 _ 230 ,340 ^ . J351 

human coronavirus OC43 NP 304) v , s i i i^EXA^f 

Bovine corona NP (304) q^si p:j ltA hi b ? A-i t F£ . - k ce LA'XVQr - 

avian infectious bronchitis virus NP (262) GlKDGnv i AML* fTjl^fPH^C \ J - l /> T PKLQPDGLHLRF 

mouse hepatitis virus NP (305) c^S.r/$Q;F> T L A E ; A r T.'G'.K * K ^E : r?VK:K|fj 

Consensus (313) GTS DP Q F P 1*1* A E L A P TAG A F F F G S RL S LAKVQN 

Section 1 0 
390 



I'VSgRPK 



(352) 352 


.360 370 


_ J80 


human coronavirus OC43 NP (337) - - -*^s55# 


r ^ CK V r E 1. RYN G/- 




Bovine corona NP (337) feS &k; 




> in ? L§tG;i 


avian infectious bronchitis virus NP (301) EFTTVVP 




;||dgv- ,-i-RPKI 


mouse hepatitis virus NP (337) 






Consensus (352) I. S G W 


DBPQKDVYELRYNG/ 


\ jrfdstlsg") 


(391) 391 _ 


400 




human coronavirus OC43 NP (373) LN EN L 13 A 
Bovine corona NP (373) f&£&& L H A 




Q R G'H KNJ3 'Q - • 


avian infectious bronchitis virus NP (340) SR l 7 Ws - - 


Q Q 9 U G M M-H M S 7,K 1 




mouse hepatitis virus NP (372) 


|liu>G ; G'ADv l v;s;i kp; 




Consensus (391) LNENLNA 


YQQQDGMMNMSPKPQRQKG KNGQ 



Section 1 1 

429 

ON 



GBNDN 
Section 12 



(430) 430 ^. _ , 440 450 468 

human coronavirus OC43 NP (409) jes 1 ■ v P K ? v Q Q ; , >' 5R E ■ TAC, rH.'.LKK MO E P Y T 

Bovine corona NP (409) |fs^f A P^stRVUQ- K SH B^'PAK . :;iLKKHD|f §g£ 

avian infectious bronchitis virus NP (373) |fp K§g|T S Di:. E < l4 : • -'- A Q ; EFD ; ;'.E P k||l MW G©]3 -A 



mouse hepatitis vims NP (410) --KPKSSVC V3 PE . RSLJiAvlSLj&DGVVPDGL 

Consensus (430) ISVALPKSRVQqSkSREi/tAEDI SLLKKMDDP YT 

Section 13 

(469) 469 474 

human coronavirus OC43NP (443) EDTSF I SEQ 10 N0: 9915 
Bovine corona NP (443) EDTS EI $EQ ID NO: 98 87 
avian infectious bronchitis virus NP (404) LG E EV id no: 9906 

mouse hepatitis virus NP (449) ^gD^jNp 
Consensus (469) E DT SEX 



PP20480.019 



48/199 

FIGURE 4C 



human coronavirus OC43 HE 
bovine coronavlrus HE 
mouse hepatitis virus HE 
Consensus 



(1) 
(D 
(1) 
(D 
(1) 



1 



10 



.20 



30 



Section 1 
42 




human coronavirus OC43 HE 
bovine coronavirus HE 
mouse hepatitis virus HE 
Consensus 



(43) 
(39) 
(39) 
(43) 
(43) 



43 



MFLLPRFI.LV SCI IGSLGFFNPPTNVVSHLNGDWFLFG 
_ , _ — , Section 2 

50 60 70 84 



human coronavirus OC43 HE 
bovine coronavirus HE 
mouse hepatitis virus HE 
Consensus 



(85) 
(81) 
(81) 
(85) 
(85) 



dsrsdcnhivninp nys ymdHwp lcdsgkisskagnsiVr 

_ — . — — Section 3 

,110 _ _ 126 

XJ^^V t ; rj A' \ * R 3 C S t\ D 1 1'" 



85 



Pi: 

S FH FTDFYN YTGE GQQI I FVEGVNFTPYHAFKC 



(127) 

human coronavirus OC43 HE (123) 
bovine coronavirus HE (123) 
mouse hepatitis virus HE (127) 
Consensus (127) 



127 



J 40 



.150 



SGSNDIW 
— Section 4 
168 




(169) 

human coronavirus OC43 HE (165) 
bovine coronavirus HE (165) 
mouse hepatitis virus HE (169) 
Consensus (169) 



(211) 

human coronavirus OC43 HE (203) 
bovine coronavirus HE (203) 
mouse hepatitis virus HE (211) 
Consensus (211) 



MQN KG.L FYTQV YK NM AV YRS LT FVMV P YVYNG S AQS TALC KS 

----- - _ Section 5 

169 ,180 ,190 200 210 

Qfy _V. . > tAY.aPCAiiSG^ i Y K J 7 . *S * Y S ><* Y ' Y 

G.l-x r 'v . ;'^AI--C ' - 1 K 1 7 O Y.,S ".C -I \ 

iIhgvt:/: itifh'i 1 v r; §|&vsk p r: v." y {;s-k>& TLo^fi>§g^ 

GS LV L N H P A YI A K E AN GO Y Y Y K V E A D V Y L S G C DEY I V 
> „ Section 6 

211 220 230 240 252 




(253) 

human coronavirus OC43 HE (245) 
bovine coronavirus HE (245) 
mouse hepatitis virus HE (253) 
Consensus (253) 



PLCIFNGKFLSNTKYYDDSQYYFNKDTGVIYGLHSTETITTG 
~ — _ _ — ~— Section 7 

253 260 270 280 294 




FDLNCH YLVL PSGN YLA I SNELLLT V PT KAICLMKRKDFT PV 



PP20480.019 



49/199 

FIGURE 4C (contd.) 



. — Section 8 

(295) 295 ,300 310 t 320 336 

human coronavirus OC43 HE (287) ^S^Vfe^ 

bovine coronavirus HE (287) u'vV0!?C<WiKNA^GS D *. : a v At g p r : " v >' S * / s . tkVv«,v ^ oifi' 
mouse hepatitis virus HE (295) pSi-is&MH " N p : gg$1^ i\\'l)v;c 0 L f f;d ^^^jferSlP^ S H 

Consensus (295) QV VDS RWNN ARQ3 D WMTA V ACQ P P YC YFRMSTT N YVGWDIN 
. Section 9 

(337) 337 350 360 _ _ _378 

human coronavirus OC43 HE (329) ^e^At, S r f £ 'Z j L^M5P,^' A ^Vi ft : D,^V SSV^r L -.EY^R 
bovine coronavirus HE (329) H-:-L>A(vi : 'CJM i..3:- 1. {)$£C?^Q&£y*&i^i\^ ? .VV$ Klk ^ y^R 
mouse hepatitis virus HE (337) H i'&fi -.G^u/V ^MXJIViC^OQC^S- V^NK\/W Sfi' ^QY^^K 
Consen sus (337 ) HGDAGFTSIL SGLL Y N S P C FSQ QGV FRY DM VS S VW PLY PYGR 

» - — ~ — Section 10 

(379) 379 390 400 410. 420 

human coronavirus OC43 HE (371) r.i r ,> »o jnn'p DL * T "Vi ,u * ^ l . v;vi v: LL : 

bovine coronavirus HE (371) L a I i 3 V;y 

mouse hepatitis virus HE (379) ah! v f-mJv|S/ n y * ~ ' -^I^S^^^ipiF^^l 

Consensus (379) CPTAADIN PDLP ICVYDPLPVI LLGILLGVAVI 1 1 VVLLLY 

~ — . *~ — — Section 1 1 

(421) 421 432 

human coronavirus OC43 HE (413) ;\ Dj SEQ ID NO: 9916 

bovine coronavirus HE (413) .-v M « T < n *„ SE( 2 ID NO: 9888 

mouse hepatitis virus HE (420) !<A A./s G AA;A EA se Q id NO: 9899 
Consensus (421) FHVDNGTRLHPA 



FIGURE 4D 



bovine coronavirus Sm 
avian infectious bronchitis virus Sm 
mouse hepatitis virus Sm 
Consensus 



(1) 1 .10 20 

(1) MFMA D AY> A P; ? V W YV G Q.l;X : i<: : r V A 
( 1 ) |in||l NKSL'Efc G S^TA|Y I 1 V G 

(1) M F N LFi nM)7VVrvvGQT IF FA 

(1) M M N PL DTVWYVGQI IFIVA 



50 



60 



bovine coronavirus Sm 
avian infectious bronchitis virus Sm 
mouse hepatitis virus Sm 
Consensus 



(40) 40 

(38) g) F K I d X p I . te ;^P:feg^f<Sl#S^ 



Section 1 

39 

ST L A&; Y 6$Jg R / <.LQ - - : V Q 
VCuMV^^f V * I. A 
ICLLVII IVVAFLA 

— . . Section 2 

78 



• : V YifjYT YG RSLNffpp^ 

(40) S KLCIQLCGLCMTLVLSPSIYLF R KQ YKFYN ELK 
Section 3 



bovine coronavirus Sm 
avian infectious bronchitis virus Sm 
mouse hepatitis virus Sm 
Consensus 



(79) 79 

(76) p'S^iW^M^Ph 



90 



(79) AVjffiglNEFPKMGWNNKNPANFQDA 
(75) Lfg^feipl 
(79) PXLDVDDI 



108 

SEQ ID NO: 9889 

QRDK.LYS SEQ ID NO: 9907 
SEQ ID NO: 9900 



PP20480.019 



50/199 

FIGURE 4E 





d) 1 


,10, 


20 


30 _ 


human coronavirus OC43 M 


(1) -m§Mi 








bovine coronavirus M 


(1) -*£|s.\ 








avian infectious bronchitis virus M 


d) 








mouse hepatitis virus M 


(1) MT Sj|. 








Consensus 


(1) MSS 


TTPAPVYTWTADEAI KFLKEWN FS LGI1 I 



Section 1 
40 



Section 2 
80 



human coronavirus OC43 M 
bovine coronavirus M 
avian infectious bronchitis virus M 
mouse hepatitis virus M 
Consensus 




human coronavirus OC43 M 
bovine coronavirus M 
avian infectious bronchitis virus M 
mouse hepatitis virus M 
Consensus 



{41)41 .50 €0 70 

(40) f-- • 

(40) 
(36) 
(41) 

(41) LQFGYTSRSMFVYVIKMI I LWLMWPLT I I LT I FNCVYALN 
_ — Section 3 

(81) 81 90 ; 100 .110 120 

(80) 

(80) 1 
(76) 
(81) 

(81) W V Y L G F S I V FT I V A 1 1 MW I V Y F VN SIR L F 1 RTGStfWS FN P 
— Section 4 

160 




human coronavirus OC43 M 
bovine corona vims M 
avian infectious bronchitis virus M 
mouse hepatitis virus M 
Consensus 



human coronavirus OC43 M 
bovine coronavirus M 
avian infectious bronchitis virus M 
mouse hepatitis virus M 
Consensus 



human coronavirus OC43 M 
bovine coronavirus M 
avian infectious bronchitis virus M 
mouse hepatitis virus M 
Consensus 



121) 121 



_ .,130. 



,140 



.150 




121) .T u^C.».M^Vri'VH?I. OYHTU -> " - i ?G E: T« VMQG V 
121) ETK3MLMC I DMKGTMYVRPI I EDYHTLTVTII RGHLYMQGI 

_. „. _ - - Section 5 

200 



161) 161 „ 
160) K 



170 



180 



190 




56) QWLrii^L n f UH'ks:&i\vm£ m\s f v u $\ kin jl :*.*«- 

161) h,^^^d: -ayvt, AKV.hi:Kf#«o:;/s|^ 

161) KLGTGYSLS DL PA YVT VAKVS HLCT YKRG FLDK I DTSGF 
_ _ — Section 6 



201) 201 




201) AV VHSKV ; Y, 1 B|,^ G , nTAljIl! p I - 
201) AVYVKSKVGNYRLPSTQKGSGMDTALLRNN1 



SEQ ID NO: 9917 

SEQ ID NO: 9890 

SEQ ID NO: 9908 

SEQ ID NO: 9901 



PP20480.019 



51/199 

FIGURE 4F 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



— _ _ — Section 1 

(1) 1 .10 20 . ,30 40 

( 1 ) IH &ii o p^i> \0mi 
(1) . : : . . - - - - - - - - : i - — . — 

'■■ ' v .;*\\;,k*j 

mv NSNGAHVS A : P;$ar-$;?$ 



(1) h^^i^^ 
(1) ftke$F6$5F^ 

( 1 ) M F L I ' L L I S L P T A F A V I G D L K C T S L 



IND DTG 



(41) 41 



50 



60 



70 



PSISTD 
- Section 2 
80 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(40) T.V n V T M G L GTY V y L H R VYIHT^I, !<' > N V i P 'J 1 S G 3 T^:R$Jj$ 

(1) - - — - ------ - 1 -Ifiv ipj.|i£ vt l lc|3lc||av L Y D 

(40) T^tiyt ngl g t v v / :.z p v y f \m t *< l l ■ 1 1 g y r p t s g p \ \ r w i 1 a 

(41) TVEVSQGLGT v YV; jPVYI.Na ^LLi I C Y / PV ^S'K^fgtj^B? 
(4 1 ) T V D V T N GLG T Y yVlDR V YL N T T £ L L N G Y Y P TSGS TYR N M A 

, — — — Section 3 

90 ,100 ,110 120 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 




LKGTVLLS 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



SWFKPPFLS DFMNGI FAKVKNTKVI KDAVMY 
. - — Section 4 

(121) 121 .130 ,140 ; 150 160 

(120) SE F I A 1 Y . G S T F V >: T S Y £ -V V V v P K 'I I H S T Q D G Y M K-L Q G L'L 
(63) SGC : 7 G I 'feHGG R'/V ASl. IMtTAP--- 

(120) H F P A I ;1' I C 3 T F V Y T X V S.V VVO — - HTTIL G F K 1 , 6 G I > 
J\ V F P : 7/ ' i G 3 L F G Y T 3YTVV t b P YN|- - - - 



(121) Ky;*v 
(121) s e f p a i t ig st fvn t sysvvvqp 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(161) 161 
(160) 



170 



180 



.190 



GNKLQGLL 
Section 5 
200 




(149) M VCQYTiCLLPYTD KPKT G,K K L.I G ' 
(161) E I S V C Q Y T MC E Y PMTICNP^LG N RXELWH 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(201) 201 



210 



220 



230 



DTGVVSCL 
Section 6 

240 



(199) -Y K RYH Y D V N A 0 Y;f ^FttF^E G G T F Y A Y F T D T G;V V T X F Y F 
(126) LSefii.IRVSAMKNGO "PLTV S V AKY PT Fr|fQCVH N ; T 
(195) YKR; l- T Y D V KA C Y L Y F K F Y Q b G « T F Y A Y FT DTG VV T X F F 
(189) LKR FT P Y VN A Y A Y F I i F Y Q i ' G r > T }• Y -\ Y Y LPPS,T Fr 
(201) YKRNFT YDVNA OYL YFH FYQE GGT FYAYFTDTGVVTKFLF 



PP20480.019 



52/199 

FIGURE 4F (contd.) 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(241) 241 _ J250 , .260 

(239) h'k:V^CM;\.^S^XY^ri^O^ 

(166) '* ^ 

(235) 
(229) 

(241) SVYLG ILSHYYVMPLTCN A 



Section 7 

270 . .280 




300 



LTLSYWVTPLTSRQY 

— — Section 8 

310 320 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



I D CMV R G iMhaQYmHG NFS: D^F^P 



(281) 281 290 
(275) ul A FU 0 Dt VI F F,M A E J . M S OF W S U I KTJ Q Si* PT T . V > EL 
(206) ALAY<FVNqTAQDvgfL 
(271) T/LAFNOD^VIFiCAVD - 

(269) § £ n ; hko k <|^tV;av!d;. : 
(281 ) l l a it n q dg v i f wav d c 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(321) 321 



330 



K'S <pM;5E3;;K KT-LS I A ? S:T G:V ; -" FT, 

a : ^;^t^e;i k"k:i;os m l«? s t' v V ^. E-L 

S S F M S EI KCK T Q S J A PS T GVYE L 

Section 9 

350 360 



340 




human coronavirus OC43 S 
avianinfecttousbronchiiisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(309) * C Y T V ;> ?.VG V V Y R V A \< LP A C N L ; K h L I A R S VPS, P L N W S R 
(321) NGYTVQPIADVY^RIPNLP CN I EAWLNDKSV PS PLNWSR 

- — ~ Section 10 

(361 ) 361 ,370 380 390 .400 

(355) K * FlT'C ;?:f:-!SCLXSr 1'. A D E FT.C UK J u A A K X .'C-MC SS'I 
(277) G A>, P v P - - S G VOnToTYQ V " T A Q S G Y \ N : ' N • S F U 

(351) k fs ,CFw:»sc5LMcri . a y s i"* <; l: ^ i aaki \gmc 
(349) v . f . j::.! : ;H: J ssLLRYy rf AEs' < K fr;?;iLASKV -GRC 
(361) KT FS NCN FN MS SLHS fTqADSFTC NM I DAAKXYGMC FS S I 
~ — _ — Section 1 1 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(401 ) 401 _ 410 420 430 440 

(395) TIDKFAi P Is G R K V D I Q LG N h G Y Q S 7 I Y R I DT J' A T ' C Q L Y 
(311) S5 FV^KE£N;FMYG( Y iP^CKFRI T . MGL'WF :3L>V I'A 
(391 ) T I n K F A I P N G R K V D ' Q G N L G.Y . Q 5 r Y I? 1 0 'I' 'I' A T :: C Q T, Y 
(389) SVDK FAVPK R-„ V D AO AG M-S G F" >Q T A YK7 DT; AT CQLH 
(40 1 ) S I D K FA IP N GRKV D L Q L GN L G Y L QS F W Y R I DTTATSCQLY 
._ — Section 12 

(441 ) 441 450 £60 _ ,470 ^ 480 

(435) l^^^^^l^^^^f^i^^iil I fe Dp|g?^ ^I^^li, tM 

(351) v p - - - - LQ? 

(431) U L ? AA x Ky£sffS PFNPST W Nl^F^FTE Q S V F K p 0 PA G V FT.. 

(429) v r^KN^^V?" p;:sv:hr?,ygf::!ja vfgk§ § 

(441) YNLPAANVSVSRFN PSTWNRRFGF £ SVFKPNPAGV TN 



PP20480.019 



53/199 

FIGURE 4F (contd.) 



. — - Section 1 3 

(481) 481 _ A?_<L &Q0 _ §10 _ _ 520 

human coronavirus OC43 S (475) fiy&y^Y^ ^ >] c ii - K a; p k n f n o.p z'Kb n g - s cVg£gp gknn 

avianinfectiousbronchitisvirusS (357) -~~GCK!§S vjF KG r H;A TCCiYAYS YGG Pi§§jci;GV Y SG - - - 

bovine coronavirus S (471) H;p^\}^ iiCt K;ASfetf CK f;DGSL\ V^?IG?G I DAGYKT S 

mouse hepatitis virus S (463) H by V>Y A ;> ,/6t T v^|;U^C ?^AQ P O X§^PC§jjTQT K P 

Consensus (481) H DVV YAQHC FKARSN F C PCKL G I.SVGSGP K 

_ . — - — Section 14 

(521) 521 ...530. 540 ..,.550 560 

human coronavirus OC43 S (509) G.IG^G F> A G/J^ Y LTCDW; ----- - : £S#fef?^ag- - F;|^T^|g-P 

avianinfectiousbronchitisvirusS (388) 

bovine coronavirus S (511) |^^^H^AAQC DC^(|^|^g^S KAcff P|^l 

mouse hepatitis virus S (497) 

Consensus (521) GIGTCPAGTNYLTC N LCTPDPXT TG YKCP 
. , — Section 15 

(56 1 ) 56 1 ,570 580 590 _ 600 

human coronavirus OC43 S (541) ^^f^S^^S^^S^S^^ g k s c t c R'P q a f l g 

avlaninfectiousbronchitisvirusS (388) - -- - EfifQ H &F E § G E % V YVT :< S G'g : - - - 

bovine coronavirus S (551) qtk \ rVVG I G-gfi ' i . iMM^B^ti Igfe vWp§Q:&M^W; 

mouse hepatitis virus S (497) - f^^f^m^t^HC i G "GV:L * :> ;c$ N ADPH ;< GDI :^ANg^g 
Consensus (561) QTKALVG IGEHC SGLAVKS DHCG GN CTC PQAFLG 

„ _ _ _ _ . Section 16 

(601) 601 610 620 _J>30 , 640 

human coronavirus OC43 S (577) v 3 A DtC LOG DKC Hi FAN FJ LK DVIk S G LT C ST I _ „ IKANTDI 

avianinfectiousbronchitisvirusS (407) T2 1 It 1 1 - -SH-|0TAT t: P : V i TQMj; Y »\ ?: i 

bovine coronavirus S (587) w;s v C k I fa n f i l h d v \ ;;ri;es-J D l ; K s ; ? c : 
mouse hepatitis virus S (536) W SHOT C £ V >i D R C: Q I r A NJLL' I K S G T f C 55't®L%L p||t§J§ 
Consensus (601) WS DSC LOG DRCS X FA N F IL H DINS G T T CSTDLQKAHTDI 
___ Section 1 7 

(641) 641 650 660 £70 680 

human coronavirus OC43 S (617) ILC-7 - N r-DJ Vf I I,C T : F V E, / :5 A 7' 'r YM UWCfC T. I, Y-J-S M G K 
avianinfectiousbronchitisvirusS (427) ri. a TO : I '* P "i - . csav — Y u Y * A ^ A • L A 
bovine coronavirus S (627) f LGV - :c IL . :t.' tf.vf,* t^vj r Y w 3 WQH T, . rr.fWGN 
mouse hepatitis virus S (576) VTGi: . n V I T c 6 :-.v F kWka D f YNVS wll%i;6|;f>^^ 
Consen SUS (64 1 ) I L G V C V N Y D L Y G I T GQG I F V E V H A T Y Y N S W Q N L LYDSWG N 
„ - — ^ Section 18 

(681) 681 690 ^ 700. _ JM0_ _ 720 

human coronavirus OC43 S (657) L YG rRD i I IN'KT F Ml RSC YS'G'Ry.SAAFH AK S S F PALLFPN 
avianinfectiousbronchitisvirusS (465) g£b T S GS S D I F V v Q G K Y G L N Y Y K V N P C F D V N Q Q F v |f S G G K 
bovine coronavirus S (667) rxfeFR o Y L T N P T F M I R S c ^ r; G r v . - a a E' h a m s s k p a l L f i\ N 
mouse hepatitis virus S (616) L II G F P D I, T T K K 1 Y r r J R. S C V 3 G R V S A A F 11 K A CPALLYRN 
Consensus (681 ) L y'gFRD Y I TNRT FM IRSC YSG R V S A A F H A N S S E P A L L F R N 



PP20480.019 



54/199 

FIGURE 4F (contd.) 



730 



740 



human coronavirus OC43 S 
avianinfectiousbronchiiisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



Section 19 
760 




human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 




human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(721) 721 

(697) §|jg¥Y^ I gy.<a 

p 

(721) IKCWY VFNNSLSRQLQPINYFDSYLGCVVNADNSTSEAVQ 
Section 20 

(761) 761 770 780 790 800 

(737) 

(545) 'v^VYvi 
(747) T;STil\\ 

(696) N WSS^M^^h pl^M§^S PML 
(761) TCDL TVG S G Y C VDYSK RRS RRS ITTGYRFTN FE *PFT VNS 
Section 21 

(801) 801 ,810 820 830 840 

(777) 
(571) 
(787) 
(736) 

(801) VNDSLEPVGG L Y E I Q X P5EFTIGNMEEFIQTS SPKVT I DC 
Section 22 

(841) 841 850 _ §60 ^ |70 880 

(817) AAF,"" C7< f A^~KSQI VH *: S*K^ D: jn^T t E;'NEL 4 LI>tTS 

(611) lq}y<v \ jld q % . v ,ir : i.svv " 1 : ke|mel 

(827) SAF* I s - Y A A 1KSQLVE ' 5F D" 7 I J A J I.TfJ N bigff^ 

(776) a a F ' * ' r^r: a RyQTvn" ,sr.v ,vijail\h:> hmlukmc 

(84 1 ) A A F V C G D Y A AC KSQ L V E 5 G S F C DN I Sa'i LTE VN S UD TTQ 
, — _ _™ — , — Section 23 

(881) 881 .890 .900 ,910 920 




human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(857) QVA SI '■A>lGV;i\] 3 T K li r 0 G VJJ F !. V f D I K . S?\ GC'I GS : " 

(651) M F Y S S T K i ' G. TPVLSKV TGt T-IShLLTK 'S' 

(867)' 
(816) 

(881) LQVASSLMNGVTLSTKLKDGVNFHVDDINFSPVIiGCLGSS 
. „ — Section 24 

(92 1 ) 921 930 ^ _„S40 „ ,950 960 

(897) Ip&AlI : - - — F 3 : 1 i DK Ki.SLVGFVE ; I H * 1 

(686) SR^KRili Lp^a^rgTSr^SVv 

(907) i^iov&f 

(856) C | E DG~NGPS A I RGR 
(921) CAK SS RSAI EDLLFDKVK LS DVGFVE AYNNCT 



IrPTKD , .Vt s. 1 



PP20480.019 



55/199 

FIGURE 4F (contd.) 

, — Section 25 

(961)961 _ 970 ,980 t 990 1000 

human coronavirus OC43 S (930) f$0- -&|I^i|s|KS^ 
avianinfectiousbronchitisvirusS (717) A 0 PLG F E^&y A« A KE^fN^Si Spii^^^A E M Q^L^-S^L V % 
bovine coronavirus S (940) §(~ne- - ^^kiny^h^^rK v i;Lsenq r S G y t£a & .& 
mouse hepatitis virus S (896) §1lQB~-;t^ 

Consensus (961) GGAE IRDLICVQS YNGIKVLP P L L S B NQ I S G YT LAAT A 

_ — Section 26 

(1001) 1001 1010 /J020 1030 1040 

human coronavirus OC43 S (968) A S L x i;*p rf A A G v / v Y 1, k * >A i .< ' G L - v » m n y ; s 0 ^ - K L .1 
avianinfectiousbronchitisvirusS (757) |«A:?,GG ifeA^ilflA T Q -Jy^A K i j%H&Sf$t!Q st|£ L K&ipB kf 
bovine coronavirus S (978) AS P ^ A'-AY - YDN^ : Ya r^&II >V : :FsA.'/ SQi iK&lj: 
mouse hepatitis virus S (934) AA : M : ;. PP>IS /iAGV rsijSv Y ; s G L 1 V " M I V A S £ if 0 fsfSpP} 
Consensus (1001 ) AS £ F P P W SAAAGV ? F YLN VQ Y R I N GLGVTM DVLSQHQ KL I 

- Section 27 

(1041) 1041 ,1050 ,1060 _ ^,1070 _ 1080 

human coronavirus OC43 S (1008) ~ Ka\;AN l'ya A DA-j:is:\ f M.K > *AlA7N^N ArMrRtf Li 1 
avianinfectiousbronchitisvirusS (797) ^ A SAV4 Ai|gH^|illR^S Lj^QQjraDgtRjS Kf||AlfcT ET| 
bovine coronavirus S (1018) \ N M LCM" :i , DAT-NS-iu >.K'i AV^MAN AIvAjI.NN Tfh 
mouse hepatitis virus S (974) ■ At" \N 1 G M .p;r.DA"NS UG*K I . ^'NAKAE AT-NN^L 
Consensus (1041) ANAF NNALG A I Q E G F D AT N S A L V K I Q A V V N AN AE AL NNLL 
, ~ Section 28 

(1 081 ) 1081 _ 1 090 _ 1.100 1110 _ 1120 

human coronavirus OC43 S (1048) QQ *SNK 7 T **A'SL • lspld' i* I- 1 ■ ~N/^ T A 
avianinfectiousbronchitisvirusS (837) a y _ > p t; x v . 4 s 3 
bovine coronavirus S (1058) QQ L S N R A ~ ' LO u L 5RL DA L 'E ^ Q" "Y^ I-L? I \ : 1 : f J> ^ 
mouse hepatitis virus S (1014) NQ *• SNR' \ i "A^L^S-l LTRLFAiVE A A ? I \-A AHG P L T A 
Consensus ( 1 081 ) QQ L SNR FGA TS AS LQEI L^SRL D AL E A N A Q I D R L I N G R L T A 
- _ - - - — — Section 29 

(1121) 1121 ,1130 ,1140 „ 1J50 _ 1160 

human coronavirus OC43 S (1088) = NA>VA 'j A is 3 D S T h v K h is A A Q .A-W 7* ~ *' /^** . SSMNAV 
avianinfectiousbronchitisvirusS (877) ' A ' g A^E Y I R v . Q > P B L s , T < ;aa A sipy r A 
bovine coronavirus 3 (1098) UAW oa l SUS 'I U v K FS AAQ^ME Vi ,^ . .-syviEJ.-*. 
mouse hepatitis Virus S (1054) . N A Y^7QLS DSTLIjK VS AAQ , T A/A 0 JA A vTT-^IN-'C 
Consensus (1121) L N A Y VS Q QL ■ S DSTLVK F S A A Q A M E K V NECVKSQSSRI N F C 
- Section 30 

(1161) 1161 1170 1180 s 1190 1200 

human coronavirus OC43 S ( 1 1 28) 7 A' ^ N : X I G I V L - ^ VCLY , H ; v c ^K0f > A> V s i t 7 
avianinfectiousbronchidsvirusS (917) . AA^r vltYp^ ^ N -TV. A I Y DS £7 ' . " I vL - V 
bovine coronavirus S (1138) . aAa , ; ia^LV Y ' L Y ^: 7 t^Vvtakvs'? a I 

mouse hepatitis virus S (1094) 0 U ANjiu L S LA/C;A' A - VAX Y. •; ; H :■■ 7- I SEA ? A AV S Au;Ai 
Consensus ( 1 1 6 1 ) OH G N H I ISLVQNA P YG'L Y F I H F $ Y V P T S F V T A K V S P G L C I 



PP20480.019 



56/199 

FIGURE 4F (contd.) 

« — . Section 31 

(1201) 1201_ ,1210 ,1220 .1230 _ 1240 

human coronavirus OC43 S (1168) - - -f^XS/iYV V^y^T Wj\3 ; Y^T,GAG^X^I:p^i 

avianinfectiousbronchitisvirusS (957) k p AH^S QYAI V pS;N G RSl Y "f iiWi'M&R A&tf 

bovine coronavinjs S ( 1 1 78) AG 0 :i t>KS . t > v N \ i>) N r WMF r G S G Y; Y « KP ' CT 

mouse hepatitis virus S (1134) S|g d rg ^ ^ ^ ^K^p^G-S^S^^^^^^rigr^ 

Consensu S (1201) AGDRG r A PKS G Y F V N V N N T W M F T G S G Y Y Y P i" P I T 
, M ; Section 32 

(1241) 1241 1250 _J260__ _ J 270 1280 

human coronavirus OC43 S (1202) E K U v - vm S*TC A v /rK^YVWLNTCi^iHPD: K E > v 
avianinfectiousbronchitisvirusS (997) AC V^Tg^sf^A >; 7 ' VN KT^TfbV^DM I) D F D \"7 , f K K 
bovine coronavirus S (1212) jg$[^ K|E : , 

mouse hepatitis virus S (1168) :;Vl/s^Msl|c ^^l^fl^f S^rh N T ; S I P- $PBD ~ KCi ;.,DKW 
Consensus (1241) ENHVVVMSS C A V H Y T K A P D V M L N T S I P NLPDFKBELDQW 

, ■■ — . — — — Section 33 

(1281) 1281 J 290 ,1300 1310 1320 

human coronavirus OC43 S ( 1 241 ) F KN q ? S.VA r ;" yLb' L D Y — IN Vt% rL^v: Mil v Xi v \a A; * K v. ? . ; ' 
avian in fee tiousbronchitlsvirusS (1037) ^knTKHELF.Or ! Kf N — Y ? 7 1 ; ~ % I s ,1 rI^G\ ~ OG&i 
bovine coronavirus S (1251) PK?ig? S V A *' I ' h S L DY - - .1 W • TE LQD ~ MN n-L , E A _ K V , 
mouse hepatitis virus S (1207) F K N QT SI .A r * 1, S L DHJE Kjg^ T ' !,TV Mir«1.0A K K , 
Consensus (1281) FKNQTSVAPDLSLDY I NVT FLDLQ EMNRI QEAIKVLN 

— Section 34 

( 1 32 1 ) 1 321 1330 ( 1 340 _ 1 350 . 1 360 

human coronavirus OC43 S ( 1 279) 0 : Y K K D X G T Y 5 V - V " ' > V y v : - ,h C L > GVkMC,V i>r, F F I 
avianinfecltoiisbronchitisviaisS(1075) * t, 1 r "iff I T * I * a ,-;^Y . A A TO • -I-Ll GWV 
bovine coronavirus S (1 289) Q Y • Kl k DIG T Y y ' 'r, , L C- ' GVAMlv:, FFI 
mouse hepatitis virus S (1247) - v k-kev,gty:x v > j, r- r, *-vavcvl r?i 
Consensus (1321) QS YINLKD I GT YEYYVKWPWY VWL LI GLAGVANLVLL FFI 
, . - - — -- Section 35 

(1361) 1 361 1 370 1 380 1 390 1 400 

human coronavirus OC43 S (1319) CCC l£& T]S CFK " GCC 0 " " T G.YO F :,v I K J 1 

avianinfectiousbronchitisvirusS (1115) FFH ; / ,^cgc ccgc fg 1 M l •': : r. . kk yvt-ti ; tdvv e 

bovine coronavirus S (1329) C-CC i s Cfe — tscfk; < : tghq'e:,v i k: 

mouse hepatitis virus S (1 287) G c C ■ ; " G S, CPK,* > * C.n K ^GCyiQ^Si^ 1 H 

Consensus (1361) CCCTGCG TSCFKKCGGCCDD x T G H Q E L V I K T 
... — - - . — Section 36 

(1401) 1401 1408 

human coronavirus OC43 S (1350) Iff! SEQ 10 N0: 9918 

avianinfectiousbronchitisvirusS (1155) QpRPKKSV SE <2 ID N0: 9909 

bovine coronavirus S (1360) |HlD SEQ ID NO: 9891 

mouse hepatitis virus S (1318) u "is SHED- seq id no: 9902 
Consensus (1401) SHOD 



PP20480.019 



57/199 

FIGURE 5 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis virus A59 S 
Consensus 



(589 
(565 
(575 
(524 
(589 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis virus A59 S 
Consensus 



(631 
(607 
(617 
(566 
(631 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis virus A59 S 
Consensus 



(673 
(649 
(659 
(608 
(673 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis virus A59 S 
Consensus 



(715 
(691 
(701 
(650 
(715 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis virus A59 S 
Consensus 



(757 
(733 
(743 
(692 
(757 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis virus A59 S 
Consensus 



(799 
(775; 
(785, 
(734 
(799 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis virus A59 S 
Consensus 



(841 
(817 
(827 
(776 
(841 



589 



,600 



610 



620 



-Section 15 
630 




N CTC PQA FI»GWS 



631 



,640 



DSCLQGDRCNI FAN F I L H DVN S GTTC S 

, Section 16 

,650 ,660 672 




TDLQKANTDI ILGVCVNYDLYGITGQGI FVEVNATYYNSWQN 

— — Section 17 

673 680 690 700 714 




LLYDSNGNL YG FRD YTTNRT FMI R SC YSGRVSAAFHANS SE P 
______ — _ Section 18 




ALLFRN I KCN YVFNNS LSRQLQPIH Y FDS YLGCVVNADNSTA 

_ g2 ——-/.Section 19 

757 _ 770 ° ^ ™ 1 " 

I S V Q 7 r MV*'-3,'V 



JBO^j^ 798 

K N * • ] srga :t - - I rrv:: - F • V 

SVVQ. , tv;s:y- . * k«- t'S i*.rv> _ r kfumf.:: .vzv 

Eg&LP RMvA ~L ' S ^ A ? S V S * f <>, Y P 



AVQTCDLTVGSGYC V DYSK 



799 



810 



RRSRRSITTGYRFTNFEPFTV 

Section 20 

820 830 840 



NSVNDSLEPVGGLYEIQI P S E FT I G NMEE F I QT S S PK VT I DC 

— Section 21 

841 850 860 870 882 



A : " " "YA* "K3 * ' ~ : "D'- I * I ^LE f*l\ 



' /A.:.;.; 



it! 

AAFVCGDYAAC KSQLVE YGS FCDNI NAXLTEVNELLDTTQLQ 



PP20480.019 



58/199 

FIGURE 6 

FIGURE 6A 




PP20480.019 



59/199 

FIGURE 6B 




PP20480.019 



60/199 

FIGURE 6C 




.10 



PP20480.019 



FIGURE 7 



61/199 



FIGURE 7 A 



SEQ ID NO: 6053 MFVLLVAYALLH 12 

SEQ ID NO: 6057 — MKKLFWLWMPLIYGDNFPCSKLTNRTIGNQWNLIETFLLNYSSRLPPNSDWLGD 60 

SEQ ID NO: 6061 MRSLIYFWLLLPVLPTLSLPQDVTRCQSTTNFRRFFSKFN VQAPAVWLGG 52 

SEQ ID NO: 6065 MFLILLISLPTAFAVIGDLKCTTVS-INDVDTGVPSIS 38 

SEQ ID NO: 6069 MLFVFILFLPSCLGYIGDFRCIQLVNSNGANVSAPSIS 40 

SEQ ID NO: 6042 MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRG 39 

SEQ ID NO: 6072 MLVTPLLLVTLLCALCSAVLYDSSS 27 



SEQ ID NO:6053 

SEQ ID NO: 6057 YFPTVQPWFNCIRNDSNDLYVTLENLKALYWDYATENITWNHRQRLNVWNGYPYSITVT 120 

SEQ ID NO: 6061 YLPSMNSSSWYCGTGIETASGVHGIFLSYIDSGQGFEIGISQEP FDPSGYQLYLH 107 

SEQ ID NO: 6065 TDTVDVTNGLGTYYVLDRVYLNTTLLLNG YY 69 

SEQ ID NO: 6069 TETVEVSQGLGTYYVLDRVYLNATLLLTG YY 71 

SEQ ID NO: 6042 VYYPDEIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFG NP 79 

SEQ ID NO: 6072 YVYYYQSAFRPPSGWHLQG 46 



SEQ ID NO: 6053 IAGCQTTN GLNTSYSVCNG 31 

SEQ ID NO: 6057 TTRNFNSAEGAIICICKGSPPTTTTESSLTCNWGSECRLNHKFPICPSNSEANCGNMLYG 180 

SEQ ID NO: 6061 KATNGNTNAIARLRICQFPDNKTLGPTVNDVTTGRNCLFNKAIPAYMRDGKDIWGITWD 167 

SEQ ID NO: 6065 PTSGSTYRNMALKGTLLLSTLWFKPPFLSDFTNGIFAKVKNTKVIKDGVMYSEFPAITIG 129 

SEQ ID NO: 6069 PVDGSKFRNLALTGTNSVSLSWFQPPYLSQFNDGIFAKVQNLKTSTPSGATAYFPTIVIG 131 

SEQ ID NO: 6042 VIPFKDGIYFAATEKSNVVRGWVFGSTMNNKSQSVIIINNSTNWIRACNFELCDNPFFA 139 

SEQ ID NO: 6072 GAY AWN I S S E FNNAG S S SGCT VG 1 1 HGGRWNAS S I AMT AP 88 



SEQ ID NO: 6053 

SEQ ID NO: 6057 LQWFADEVVAYLHGAS YRI S FENQWSGTVT FGDMRATTLEVAGTLVDLWWFNPVYDVS YY 240 

SEQ ID NO: 6061 NDRVTVFADKIYHFYLKNDWSRVATRCYNRRSCAMQYVYTPTYYMLN 214 

SEQ ID NO: 6065 STFVNTSYSWVQPHTTILGNKLQGFLEISVCQYTMCEYPNT 171 

SEQ ID NO: 6069 SLFGYTSYTWIEPYN GVIMASVCQYTICLLPYT 165 

SEQ ID NO: 6042 VS KPMGTQTH TMI FDNAFN 158 



SEQ ID NO: 6072 



SEQ ID NO: 6053 CVGYSENVFAVESGGYIPSDFAFNNWFLLTNTSSWDGWRSF 74 

SEQ ID NO: 6057 RVNNKNGTTWSNCTDQCASYVANVFTTQPGGFIPSDFSFNNWFLLTNSSTLVSGKLVTK 300 

SEQ ID NO: 6061 VTSAGEDGIYYEPCTANCTGYAANVFATDSNGHIPEGFSFNNWFLLSNDSTLLHGKVVSN 274 

SEQ ID NO: 6065 ICNPN-LGNQRVELWHWDTGWSCLYKRNFTYDVNADYLYFHFYQEGGTFYAYFTDTGW 230 

SEQ ID NO: 6069 DCKPNTNGNKLIGFWHTDVKPPICVLKRNFTLNVNADAFYFHFYQHGGTFYAYYADKPSA 225 

SEQ ID NO: 6042 -CTFEYISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDWRDLPSGFNTLK 217 

SEQ ID NO: 6072 SSGMAWSSSQFCTAHCNFSDTTVFVTHCYKHGG — CPLTGMLQQN 131 



SEQ ID NO: 6053 QPLLLNCLWSVSGLRFTTGFVYFNGTGRGDCKGFSSDVLSDVIRYNLNFEENLRRG T 131 

SEQ ID NO: 6057 QPLLVNCLWPVPSFEEAASTFCFEGAGFDQCNGAVLNNTVDVIRFNLNFTTNVQSGKGAT 360 

SEQ ID NO: 6061 QPLLVNCLLAIPKIYGLGQFFSFNHTMDGVCNGAAVDRAPEALRFNINDTSVILAEG — S 332 

SEQ ID NO: 6065 TKFLFNVYLGTVLSHYYVMPLTCN SALTLEYWVT PLTSKQYLLAFNQDGVI FNAVD 286 

SEQ ID NO: 6069 TTFLFSVYIGDILTQYYVLPFICNPTAGSTFAPRYWVTPLVKRQYLFNFNQKGVITSAVD 285 

SEQ ID NO: 6042 PIFKLPLGINITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVD 277 
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SEQ ID NO: 6072 LIRVSAMKNGQLFYNLTVSVAKYPTFRSFQCVNNLTSVYLNGDLVYTSNETIDVTSAGVY 191 



SEQ ID NO: 6053 ILFKTSYG VWFYCTNNT-LVSGDAHIPFGTVLGNFYCFVNTTIGNETTSAFVGAL 186 

SEQ ID NO: 6057 VFSLNTTGGVTLEISCYTVSDS-SFFSYGEIPFGVTDGPRYCYVHY NGTALKYLGTL 416 

SEQ ID NO: 6061 IVLHTALG TNLSFVCSNSSDPHLAIFAIPLGATEVPYYCFLKVDTYNSTVYKFLAVL 389 

SEQ ID NO: 6065 CKSDFMSEIKCKTLSIAPSTGVYELNGYTVQPIADVYRRIPNLPDCN-IEAWLNDKSVPS 345 

SEQ ID NO: 6069 CASSYTSEIKCKTQSMLPSTGVYELSGYTVQPVGWYRRVANLPACN-IEEWLTARSVPS 344 

SEQ ID NO: 6042 CSQNPIAELKCSVKSFEIDKGIYQTSNFRVVPSGDVVR-FPNITNLCPFGEVFNATKFPS 336 

SEQ ID NO: 6072 FKAGGPITYKVMREVKALAYFVNGTAQDVILCDGSPRGLLACQYNTGNFS DGFYPFTNSS 251 



SEQ ID NO: 6053 PKTVRE FVI SRTGHFYINGYRYFTLGNVEAVN FNVT TAETTDFCTVALASYADVLV 242 

SEQ ID NO: 6057 PPSVKEIAISKWGHFYINGYNFFSTFPIDCISFNLT TGDSDVFWTIAYTSYTEALV 472 

SEQ ID NO: 6061 PPTVREIVITKYGDVYVNGFGYLHLGLLDAVTINFTGHGTDDDVSGFWTIASTNFVDALI 449 

SEQ ID NO: 6065 PLNWERKT FSNCNFNMSSLMS FIQAYS FTCNNI DAA KIYGMCFSSITIDKFAIPNG 401 

SEQ ID NO: 6069 PLNWERKT FQNCNFNLSSLLRYVQAESLFCNNI DAS KVYGRC FGS I S VDKFAVPRS 400 

SEQ ID NO: 6042 VYAWERKKI SNC VAD Y SVL YNS TFFS TFKC YGV SAT KLNDLCFSNVYADSFWKGD 392 

SEQ ID NO: 6072 LVKQKFIVYRENSVNTTCTLHNFIFHNETGANPNPSG — VQNIQTYQTKTAQSGYYNFNF 309 



SEQ ID NO: 6053 NVSQTSIANIIYCNSVINRLRCDQLSFDVPDGFYSTSP — IQSVELPVSIVSLPVYHKHT 300 

SEQ ID NO: 6057 QVENTAITKVTYCNSHVNNIKCSQITANLNNGFYPVSS— SEVGLVNKSVVLLPSFYTHT 530 

SEQ ID NO: 6061 EVQGTSIQRILYCDDPVSQLKCSQVAFDLDDGFYPISSRNLLSHEQPISFVTLPSFNDHS 509 

SEQ ID NO: 6065 RKVDLQLGNLGYLQSFNYRIDTTATSCQLYYNLPAANVS — VSRFNPSTWNRRFGFTEQS 459 

SEQ ID NO: 6069 RQVDLQLGNSGFLQTANYKIDTAATSCQLHYTLPKNNVT — INNHNPSSWNRRYGFNDAG 458 

SEQ ID NO: 6042 DVRQIAPGQTGVIADYNYKLPDDFMGCVLAWNTRNIDAT STGNYNYKYRYLRHG 446 

SEQ ID NO: 6072 SFLSSFVYKESNFMYGSYHPSCKFRLETINNGLWFNSLS VS I AYGPLQGGCKQS 363 



SEQ ID NO: 6053 FIVLYVDFKPQ-SGGGKCFNCYPAGVNITLANFNET— KGPLCVDTSHFTTKYVAVYAN 356 

SEQ ID NO: 6057 IVNITIGLGMKRSGYGQPIASTLS — NITLPMQDHN TDVYCIRSDQFS-VYVHSTCK 584 

SEQ ID NO: 6061 FVNITVS AAFGGLS S ANLVAS DTT ING FS S . FCVDTRQFT IT LFYNVTN 557 

SEQ ID NO: 6065 VFKPQPAGVFTDHDWYAQHCFKASTNFCPCKLDGSLCVGNGPGIDAGYKTSGIGTCPAG 519 

SEQ ID NO: 6069 VFGKN QHDVVYAQQCFTVRSSYCPC 483 

SEQ ID NO: 6042 KLRPFER 453 

SEQ ID NO: 6072 VFKGRAT 370 



SEQ ID NO: 6053 VGRWS ASINTGNCPFSFGKVNNFVKFGSVCFSLKDIPGGCAMPIVA 402 

SEQ ID NO: 6057 SALWDNIFKRNCTDVLDATAVIKTGTCPFSFDKLNNYLTFNKFCLSLSPVGANCKFDVAA 644 

SEQ ID NO: 6061 SYGYVS KSQDSNCPFTLQSVNDYLSFSKFCVSTSLLAGACTIDLFG 603 

SEQ ID NO: 6065 TNYLTCHNAAQCDCLCTPDPITSKATGPYKCPQTKYLVGIGEHCSGLAIKSDHCG G 575 

SEQ ID NO: 6069 AQPDIVSPCTT QTKPKSAFVNVGDHCEGLGVLEDNCGNADPH 525 

SEQ ID NO: 6042 DISNVPFSPDGKPCTPPALNCYWPLND 480 

SEQ ID NO: 6072 CCYAYSYGGPSLCKGVYSGELDHNFECGL 399 



SEQ ID NO: 6053 NWAYSKY YTIG SLYVSWSDGDGITGVPQPVEGVSSFMNVTLDKC 446 

SEQ ID NO: 6057 -RTRTNE QWR SLYVIYEEGDNIVGVPSDNSGVHDLSVLHLDSC 687 

SEQ ID NO: 6061 YPAFGSG VKLT SLYFQFTKGELITGTPKPLEGITDVSFMTLDVC 647 

SEQ ID NO: 6065 NPCTCQP— QAFLGWSVDSCLQGDRCNIFANFILHDVNSGTTCSTDLQKSNTDIILGVC 632 

SEQ ID NO: 6069 KGCICAN— NSFIGWSHDTCLVNDRCQIFANILLNGINSGTTCSTDLQLPNTEWTGIC 582 

SEQ ID NO: 6042 YGFYTTTGIGY QPYRWVLSFELLNAPATVCGPKLSTDLIKNQC 524 

SEQ ID NO: 6072 LVYVTKS GGSRIQTATEPPVITQNNYNNITLNTC 433 
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SEQ 


ID 


NO: 6053 


SEQ 


ID 


NO: 6057 


SEQ 


ID 


NO: 6061 


SEQ 


ID 


NO: 6065 
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ID 


NO: 6069 
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NO: 6042 


SEQ 


ID 


NO: 6072 


SEQ 


ID 


NO: 6053 


SEQ 


ID 


NO: 6057 


SEQ 


ID 


NO: 6061 


SEQ 


ID 


NO: 6065 


SEQ 


ID 


NO: 6069 


SEQ 


ID 


NO: 6042 


SEQ 


ID 


NO: 6072 



T K YN I Y DVS GVG V I RVS N DT FLN GITYTSTSGNLLGFKDVTKGTIYSITPC 497 

TDYNIYGRTGVGIIRQTNRTLLS GLYYTSLSGDLLGFKNVSDGVIYSVTPC 738 

TKYTIYGFKGEGIITLTNSSILA GVYYTSDSGQLLAFKNVTSGAVYSVTPC 698 

VNYDLYGITGQGI FVEVNATYYNS WQNLLYDSNGNLYGFRDYLTNRTFMIRSC 685 

VKYDLYGITGQGVFKEVKADYYNS WQTLLYDVNGNLNGFRDLTTNKTYTIRSC 635 

VNFNFNGLTGTGVLTPSSKRFQP FQQFGRDVSD FTD SVRD PKTSEILDISPC 576 

VDYNIYGRTGQGFITNVTDSAVSYNYLADAGLAILDTSGSIDIFWQGEYGLNYYKVNPC 493 

.:. :.* 

NPPDQLWYQQA — WGAMLSENFTSYGFSNWELPKFFYASNGTYN 542 

DVSAQAAVIDGT — IVGAITSINSELLGLTHWTTTPNFYYYSIYNYTNDRTRGTAIDSND 796 

SFSEQAAYVNDD— IVGVISSLSNS— TFNNTRELPGFFYHSNDGSN 741 

YSGRVSAAFHAN— SSEPALLFRNIKCNYVFNNTLSRQLQPINYFDSYLGCWNADNSTS 743 
YSGRVSAAFHKD— APEPALLYRNINCSYVFSNNISREENPLNYFDSYLGCWNADNRTD 693 
AFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCL 636 
EDVNQQFWSGGK-LVGILTSRNETGSQLLENQFYIKITNGTRRFRRSITEN 544 



SEQ ID NO: 6053 -CTDAVLTYSSFGVCADGS IIAVQPRNVSYDSVSAIVTAN LSI 584 

SEQ ID NO: 6057 VDCEPVITYSNIGVCKNGA FVFINVTHSDGD-VQPISTGN VTI 838 

SEQ ID NO: 6061 -CTEPVLVYSNIGVCKSGS IGYVPSQYGQVK-IAPTVTGN ISI 782 

SEQ ID NO: 6065 SVVQTCDLTVGSGYCVDYSTKRRSRRSITTGYRFTNFEPFTVNSVNDSLEPVGGLYEIQI 803 

SEQ ID NO: 6069 EALPNCDLRMGAGLCVDYSKSRRADRSVSTGYRLTTFEPYTPMLVNDSVQSVDGLYEMQI 753 

SEQ ID NO: 6042 IGAEHVDTSYECDIPIGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAI 696 

SEQ ID NO: 6072 VANCPYVSYGKFCIKPDGS IATIVPKQLEQFVAPLFNVTEN VLI 588 



PSNWTTSVQVEYLQITSTPIWDCSTYVCNGNVRCVELLKQYTSACKTIEDALRNSARLE 644 

PTNFTISVQVEYIQVYTTPVSIDCSRYVCNGNPRCNKLLTQYVSACQTIEQALAMGARLE 898 

PTNFSMSIRTEYLQLYNTPVSVDCATYVCNGNSRCKQLLTQYTAACKTIESALQLSARLE 842 

PSEFTIGNMEEFIQTSSPKVTIDCSAFVCGDYAACKSQLVEYGSFCDNINAILTEVNELL 863 

PTNFT I GHHEE FIQTRS PKVT I DCAAFVCGDNTACRQQLVEYGS FCVNVNAI LNEVNNLL 813 

PTNFS ISI TTEVMPVSMAKTS VD CNMY I C GD S TECANLLLQYGS FC TQLNRAL S GI AAEQ 756 

PNSFNLTVTDEYIQTRMDKVQINCLQYVCGSSLDCRKLFQQYGPVCDNILSWNSVGQKE 648 
* . . : . * : : : * : : * . . *.::*.* : : 

S ADVS EMLT FDKKAFT LANVS S FG DYN LSSVIPSLPTSGS 684 

NMEVDSMLFVSENALKLASVEAFNSSETLDPIYKEWPNIGGSWLEGLKYILPSHNS 954 

S VE VNS MLT I S E E ALQLAT I S S FNG DG YNFTNVLGASVYDPASGR 887 

DTTQLQVANSLMNGVTLSTKLKDGVNFN VDDINFSPVLGCLGSDCN 909 

DNMQLQVAS ALMQGVT ISSRLPDGISGP IDDINFSPLLGCIGSTCAEDGN 863 

DRNTREVFAQVKQMYKTPTLKYFGGFN FSQILPDPLKPTK 796 

DMELLNFYSSTKPAGFNTPVLSNVSTG EFNISLLLTNPSSRRK 691 



RVAGRSAIEDILFSKLVTSGLGTVDADYKKCTKGLS — IADLACAQYYNGIMVLPGV 739 

KRKYRSAIEDLLFDKVVTSGLGTVDEDYKRCTGGYD — IADLVCAQYYNGIMVLPGV 1009 

WQKRSVIEDLLFNKVVTNGLGTVDEDYKRCSNGRS — VADLVCAQYYSGVMVLPGV 942 

KVSSRSAIEDLLFSKVKLSDVG-FVEAYNNCTGGAE — IRDLICVQSYNGIKVLPPL 963 

GPSAIRGRSAIEDLLFDKVKLSDVG-FVEAYNNCTGGQE — VRDLLCVQSFNGIKVLPPV 920 

RSFIEDLLFNKVTLADAG-FMKQYGECLGDIN — ARDLICAQKFNGLTVLPPL 846 

SEQ ID NO: 6072 RSLIEDLLFTSVESVGLP-TNDAYKNCTAGPLGFFKDLACAREYNGLLVLPPI 743 
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SEQ 


ID 


NO: 6053 
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NO:6061 
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NO:6069 
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NO:6072 


SEQ 
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NO: 6053 
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NO: 6057 
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NO: 6061 
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NO:6065 
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NO: 6072 
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NO: 6057 
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ID 


NO: 6061 
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ID 


NO:6065 
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ID 


NO:6069 
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ID 


NO: 6042 
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ID 


NO:6072 
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ID 


NO: 6053 


SEQ 


ID 


NO: 6057 


SEQ 


ID 


NO: 6061 


SEQ 


ID 


NO: 6065 


SEQ 


ID 


NO: 6069 


SEQ 


ID 


NO: 6042 


SEQ 


ID 


NO: 6072 


SEQ 


ID 


NO: 6053 


SEQ 


ID 


NO: 6057 


SEQ 


ID 


NO: 6061 


SEQ 


ID 


NO: 6065 


SEQ 


ID 


NO:6069 


SEQ 


ID 


NO: 6042 


SEQ 


ID 


NO: 6072 



ADAERMAMYTGSLIGGIALGGLTS AVSIPFSLAIQARLNYVALQTDVLQENQKILA 795 

ANADKMTMYTASLAGGITLGALGGG AVAIPFAVAVQARLNYVALQTDVLNKNQQILA 1066 

VDAEKLHMYSASLIGGMALGGITA AAALPFSYAVQARLNYLALQTDVLQRNQQLLA 998 

LSENQISGYTLAATSASLFPPWSA AAGVPFYLNVQYRINGIGVTMDVLSQNQKLIA 1019 

LSESQISGYTTGATAAAMFPPWSA AAGVPFSLSVQYRINGLGVTMNVLSENQKMIA 976 

LTDDMI AAYTAALVS GTATAGWTFGAGAALQI PFAMQMAYRFNGI GVTQNVL YENQKQI A 906 

ITAEMQALYTSSLVASMAFGGITA AGAIPFATQLQARINHLGITQSLLLKNQEKIA 799 

*: * :** : *:* :.: .:* .**: :* 

ASFNKAMTNIVDAFTGVNDAITQTSQALQTVATALNKIQDWNQQGNSLNHLTSQLRQNF 855 
SAFNQAIGNITQSFGKVNDAIHQTSRGLATVAKALAKVQDWNIQGQALSHLTVQLQNNF 1126 
ESFNSAIGNITSAFESVKEAISQTSKGLNTVAHALTKVQEWNSQGSALNQLTVQLQHNF 1058 

N A FNNALG A I QEG F DATN SALVKIQAWNANAEALNNLLQQLSNRF 1065 

SAFNNALGAIQDGFDATN SALGKIQSWNANAEALNNLLNQLSNRF 1022 

NQFNKAI SQIQE SLTTTS TALGKLQDWNQNAQALNTLVKQLS SNF 952 

AS FNKAIGHMQEGFRSTS LALQQIQDWSKQSAILTETMASLNKNF 845 

**.*: : ..: ** ::* **. *. *. .* .* 

QAISSSIQAIYDRLDTIQADQQVDRLITGRLAALNVFVSHTLTKYTEVRASRQLAQQKVN 915 

QAISSSISDIYNRLDELSADAQVDRLITGRLTALNAFVSQTLTRQAEVRASRQLAKDKVN 1186 

QAISSSIDDIYSRLDILSADVQVDRLITGRLSALNAFVAQTLTKYTEVQASRKLAQQKVN 1118 

GAISSSLQEILSRLDALEAQAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAAQAMEKVN 1125 

GAISASLQEILTRLEAVEAKAQIDRLINGRLTALNAYISKQLSDSTLIKVSAAQAIEKVN 1082 

GAISSVIiNDILSRLDKVEAEVQIDRLITGRLQSLQTYWQQLIRAAEIRASANLAATKMS 1012 

GAISSVIQEIYQQFDAIQANAQVDRLITGRLSSLSVLASAKQAEYIRVSQQRELATQKIN 905 
***. :> * ... . * b *.******* :*.. : : , * *:. 

ECVKSQSKRYG FCG-NGTH I FS I VNAAPEGLVFLHT VLLPTQYKDVEAWSGLCVDG 970 

ECVRSQSQRFGFCG-NGTHLFSLANAAPNGMIFFHTVLLPTAYETVTAWPGICASDG-DR 1244 

ECVKSQSQRYG FCGGDGEH I FSLVQAAPQGLLFLHT VLVPGDFVNVLAIAGLCVNG 1174 

ECVKSQSSRINFCG-NGNHIISLVQNAPYGLYFIHFSYVPTKYVTAKVSPGLCIAG 1180 

ECVKSQTTRINFCG-NGNHILSLVQNAPYGLYFIHFSYVPISFTTANVSPGLCISG 1137 

ECVLGQSKRVDFCG-KGYHLMSFPQAAPHGWFLHVTYVPSQERNFTTAPAICHEG 1067 

ECVKSQSIRYSFCG-NGRHVLTIPQNAPNGIVFIHFSYTPDSFVNVTAIVGFCVKPANAS 964 
*** * . * # * : ** *: *:* * .:* 

TNGYVLRQPNLALYK EGNYYRITSRIMFEPRIPTMADFVQIENCNVT FVNISRS 1024 

TFGLWKDVQLTLFRN LDDKFYLT PRTMYQPRVATS S D FVQI EGC DVL FVNATVS 1299 

EIALTLREPGLVLFTHELQTYTATEYFVSSRRMFEPRKPTVSDFVQIESCWTYVNLTSD 1234 

DRGIAPKSGYFVNVNN TWMFTGSGYYYPEPITGNNVWMSTCAVNYTKAPDV 1232 

DRGLAPKAGYFVQDDG EWKFTGSSYYYPEPITDKNSVIMSSCAVNYTKAPEV 1189 

-KAYFPREGVFVFNGT SWFITQRNFFSPQIITTDNTFVSGNCDWIGIINNT 1118 

QYAIVPANGRGI FIQVN GSYYITARDMYMPRAITAGDWTLTSCQANYVSVNKT 1018 

ELQTIVP-EYIDVNKTLQELSYKL-PNYTVPDLVVEQYNQTILNLTSEISTLENKSAELN 1082 
DLPSIIP-DYIDINQTVQDILENFRPNWTVPELTFDIFNATYLNLTGEIDDLEFRSEKLH 1358 
QLPDVIP-DYIDVNKTLDEILASL-PNRTGPSLPLDVFNATYLNLTGEIADLEQRSESLR 1292 

MLNISTP-NLPYFKEELDQWFKNQTSVAPDLSLDY— INVTFLDLQDEMN 1279 

FLNTSI P-NPPDFKEELDKWFKNQTSIAPDLSLDFEKLNVTLLDLTYEMN 1238 

VYDPLQP-ELDSFKEELDKYFKNHTSPDVDLGDISG-INASWNIQKEID 1166 

VITTFVDNDDFDFNDELSKWWNDTKHELPDFDKFN— YTVPILDIDSEID 1066 
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YTVQKLQTLIDNINSTLVDLKWLNRVETYIKWPWWVWLCISWLIFWSMLLLCCCSTGC 1142 

NTTVELAI LI DN INNTLVNLEWLNRI ET YVKWPWYVWLLI GLVVI FC I PLLLFCCCSTGC 1418 

NTTEELRSLINNINNTLVDLEWLNRVETYIKWPWWVWLIIVIVLIFWSLLVFCCISTGC 1352 

RLQEAIKVLNQSYINLKDIGTYEYYVKWPWYVWLLIGFAGVAMLVLLFFICCCTGC 1335 

RIQDAIKKLNESYINLKEVGTYEMYVKWPWYVWLLIGLAGVAVCVLLFFICCCTGC 1294 

RLNEVAKNLNESLIDLQELGKYEQYIKWPWYVWLGFIAGLIAIVMVTILLCCMTSC 1222 

RIQGVIQGLNDSLIDLEKLSILKTYIKWPWYVWLAIAFATIIFILILGWVFFMTGC 1122 

.* . . . *.****.*** ... * * 

CGFFSCFA SSIRGCCESTKLPYYD-VEKIHIQ 1173 

CGCIGCLG SCCHSICSRRQFENYEPIEKVHVH 1450 

CGCCGCCG ACFSGCCRGPRLQPYEAFEKVHVQ 1384 

G--TSCFK KCGGCCDDYTGHQELVIKT— SHED- 1364 

G— SCCFK KCGNCCDEYGGHQDSIVIHNISSHED- 1326 

CSCLKGAC SCG- SCCKFDEDDSEPVLKGVKLHYT- 1255 

CGCCCGCFGIMPLMSKCGKKSSYYTTFDNDWTEQYRPKKSV 1164 



FIGURE 7B 



SEQ ID NO: 6054 M FLKLVD DHA- L WN VLLWC WL I V I LLVC I T 1 1 KL I KLC FTCHM FCNRT VY 51 

SEQ ID NO: 6062 MLQLVN DNG - LWNVI LWLFVL FFLLI I SIT FVQLVNLCFTCHRLCNSAVY 54 

SEQ ID NO: 6058 MT FPRALTVI DDNG-MVINI I FWFLLI 1 1 LILLS IALLNI IKLCMVCCNLGRTVI I 59 

SEQ ID NO: 6045 MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCNIVNVSLV 52 

SEQ ID NO: 6073 MNLLNKS LEENG - S FLT ALYI I VGFLALYLLGRALQAFVQAADACCL FWYT WW 57 

SEQ ID NO: 6066 MFMADAY FADT VW YVGQ 1 1 F I VAI C LLV 1 1 VWA FLAT FKLC I QLCGMCNT LVL 54 
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GP IKNVYH-IY-QSYMH IDPF 


PKRVIDF 


77 
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ID 


NO: 


6062 


TP IGRLYR-VY-KSYMR IDPL 


PSTVIDV 


80 
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ID 


NO: 


6058 


VP AQHAYD- AY- KNFMR IKAYN 


PDGALLA 


86 


SEQ 


ID 


NO: 


6045 


KP TVYVYS -RV-KNLNS SEGV 


PDLLV 


76 


SEQ 


ID 


NO: 


6073 


IPGAKGTAFVYKYTYGRKLNNPELEAVIVNEFPKNGWNNKNPANFQDAQRDKLYS 


112 


SEQ 


ID 


NO: 


6066 


SP SIYVFN-RG-RQFYEF YNDVKP 


PVLDVDDV 

* 


84 



FIGURE 7C 



MSNDNC TGDIVTHLKNWNF 19 

N0 j i ( MSNGSIP VDEVIEHLRNWNF 24 

MKILLILACVIACACGERYCAMKSDTDLSCRNSTASDCESCFNGGDLIWHLANWNF 60 

MSSVT-TPAPVYTWT ADEAIKFLKEWNF 27 

MSSTTQAPEPVYQWT ADEAVQFLKEWNF 33 

MADNGTIT VEELKQLLEQWNL 21 

MPNETNCTLD FEQSVQLFKEYNL 28 
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GWNVILTIFIVILQFGHYKYSRLFYGLKMLVLWLLWPLVLALSIFDTWANWDSN-WAFVA 78 

TWNIILTILLWLQYGHYKYSVFLYGVKMAILWILWPLVLALSLFDAWASFQVN-WVFFA 83 

SWSIILIVFITVLQYGRPQFSWFVYGIKMLIMWLLWPWLALTIFNAYSEYQVSRYVMFG 120 

SLGIILLFITVILQFGYTSRSMFVYVIKMVILWLMWPLTIILTIFN— CVYALN-NVYLG 84 

SLGIILLFITIILQFGYTSRSMFIYWKMIILWLMWPLTIVLCIFN — CVYALN-NVYLG 90 

SEQ ID NO: 6046 VI GFLFLAWIMLLQFAYSNRNRFLYI IKLVFLWLLWPVTLACFVLA — AVYRIN-WVTGG 78 
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SEQ ID NO: 6074 FITAFLLFLTIILQYGYATRSKVIYTLKMIVLWCFWPLNIAVGVIS — CTYPPN-TGGLV 85 
:: :**:. ...*:*:.:*:**:: :: : 

SEQ ID NO: 6055 FSFFMAVSTLVMWVMYFANSFRLFRRART FWAWNPEVNAITVTTVL-GQTYYQPIQQAPT 137 

SEQ ID NO: 6063 FSILMACITLMLWIMYFVNSIRLWRRTHSWWSFNPETDALLTTSVM-GRQVCIPVLGAPT 142 

SEQ ID NO: 6059 FSIAGAIVTFVLWIMYFVRSIQLYRRTKSWWSFNPETKAILCVSAL-GRSYVLPLEGVPT 179 

SEQ ID NO: 6067 FSIVFTIVAIIMWIVYFVNSIRLFIRTGSWWSFNPETNNLMCIDMK-GRMYVRPIIEDYH 143 

SEQ ID NO: 6070 FSIVFTIVSIVIWIMYFVNSIRLFIRTGSWWSFNPETNNLMCIDMK-GTVYVRPIIEDYH 149 

SEQ ID NO: 6046 IAIAMACIVGLMWLSYFVASFRLFARTRSMWSFNPETNILLNVP 137 

SEQ ID NO: 6074 AAIILTVFACLSFVGYWIQSIRLFKRCRSWWSFNPESNAVGSILLTNGQQCNFAIESVPM 145 



SEQ ID NO: 6055 GITVTLLSGVLYVDGHRLASGVQVHNLPEYMTVAVPSTTIIYSRVGRSVNSQNSTG — WV 195 

SEQ ID NO: 6063 GVTLTLLSGTLLVEGYKVATGVQVSQLPNFVTVAKATTTIVYGRVGRSVNASSGTG — WA 200 

SEQ ID NO: 6059 GVTLTLLSGNLYAEGFKIAGGMNIDNLPKYVMVALPSRTIVYTLVGKKLKASSATG — WA 237 

SEQ ID NO: 6067 TLTVTIIRGHLYMQGIKLGTGYSLSDLPAYVTVAKVSHLLTYKRG-FLDKIGDTSG — FA 200 

SEQ ID NO: 6070 TLTAT 1 1 RGHLYMQGVKLGTG FS LS DLPAYVTVAKVSHLCT YKRA- FLDKVDGVSG — FA 206 

SEQ ID NO: 6046 VIGAVIIRGHLRMAGHSLGR-CDIKDLPKEITVATS-RTLSYYKLGASQRVGTDSG — PA 193 

SEQ ID NO: 6074 VLS P 1 1 KNGVLYCEGQWLAK-CE PDHLPKDI FVCT PDRRN I YRMVQKYTG DQSGNKKRFA 204 



SEQ ID NO: 6055 FYVRVKHGDFSAVSSPMSNMTENERLLHFF 225 

SEQ ID NO: 6063 FYVRSKHGDYSAVSNPSAVLTDSEKVLHLV 230 

SEQ ID NO: 6059 YYVKSKAGDYSTEAR-TDNLSEQEKLLHMV 266 

SEQ ID NO: 6067 VYVKSKVGNYRLPSTQKGSGLDTALLRNNI 230 

SEQ ID NO: 6070 VYVKSKVGNYRLPSN-KPSGADTALLR — I 233 

SEQ ID NO: 6046 AYNRYRIGNYKLNTDHAGSNDNIALLVQ — 221 

SEQ ID NO: 6074 T FVYAKQSVDTGELES VATGGS S L YT 230 



FIGURE 7D 



SEQ ID NO: 6056 MATVKWADASE PQRGRQG 18 

SEQ ID NO: 6064 MASVSF QDRGRK 17 

SEQ ID NO: 6060 MANQGQRVSWGDEST KTRGRSNSRGRKN 31 

SEQ ID NO: 6068 MSFTPGKQSSS-RASSGNRSGNGILK WADQSDQSRNVQTRGRR-AQPKQTATSQQPS 55 

SEQ ID NO: 6071 MSFVPGQENAGGRSSSVNRAGNGILKKTTWADQTERGPNNQNRGRR-NQPKQTATTQ-PN 58 

SEQ ID NO: 6051 MSDNGPQ SNQRS APRI TFGGPTD S TDNNQNGGRNGARPKQRRPQGLPN 48 

SEQ ID NO: 6075 MAS G KAAG KT DA PA P V I KLGG P K P PKVGSS 35 



SEQ ID NO: 6056 RIPYSLYSPLLVDSEQPW-KVIPRNLVPINKK-DKNKLIGYWN — VQKRFRTRKGK 70 

SEQ ID NO: 6064 RVPLSLYAPLRVTNDKPLSKVLANNAVPTNKG-NKDQQIGYWN — EQIRWRMRRGE 70 

SEQ ID NO: 6060 NNIPLSFFNPITLQQGSKFWNLCPRDFVPKGIG-NRDQQIGYWN — RQTRYRMVKGQ 85 

SEQ ID NO: 6068 GGNWPYYSWFSGITQFQKGKEFEFAEGQGVPIAPGVPATEAKGYWYRHNRRSFKTADGN 115 

SEQ ID NO: 6071 SGSWPHYSWFSGITQFQKGKEFQFAEGQGVPIANGIPASEQKGYWYRHNRRSFKTPDGQ 118 

SEQ ID NO: 6051 NT ASWFTALTQHGK-EELRFPRGQGVPINTNSGPDDQIGYYRRATRR-VRGGDGK 101 

SEQ ID NO: 6075 GNASWFQAIKAKKLNTPPPKFEGSGVPDNENIKPSQQHGYWR — RQARFKPGKGG 88 



SEQ ID NO: 6056 RVDLSPKLHFYYLGTGPHKDAKFRERVEGWWVAVDGAKTEPTGYGVRRKNSEPEIPHFN 130 
SEQ ID NO: 6064 RIEQPSNWHFYYLGTGPHGDLRYRTRTEGVFWVAKEGAKTEPTNLGVRKASEKPIIPKFS 130 
SEQ ID NO: 6060 RKELPERWFFYYLGTGPHADAKFKDKLDGWWVAKDGAMNKPTTLGSRGANNESKALKFD 145 
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SEQ ID NO: 6068 QRQLLPRWYFYYLGTGPHAKDQYGTDIDGVYWVASNQADVNTPADILDRDPSSD — EAIP 173 

SEQ ID NO: 6071 QKQLLPRWYFYYLGTGPHAGASYGDSIEGVFWVANSQADTNTRSDIVERDPSSH — EAIP 176 

SEQ ID NO: 6051 MKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGALNTPKDHIGTRNPNNN — AATV 159 

SEQ ID NO: 6075 RKPVPDAWYFYYTGTGPMDLNWGDTQDGIVWVAAKGADTKSRSNQGTRDPDKF — DQYP 146 

******* . . * . *** ^ * 

SEQ ID NO: 6056 QKLPNGVTVVEE PDSRAPSRSQSR SQSRGRGESK 164 

SEQ ID NO: 6064 QQLPSVVEIVEPNTPPASRANSRSRSRGNGNNRSRSPSNNRGNNQSRGNSQNRGNNQGRG 190 

SEQ ID NO: 6060 GKVPGEFQLEVN QSRDNSRSRSQ SRSRSRNR 176 

SEQ ID NO: 6068 TRFPPGTVLPQGYYIEGS-GRSAPNSR STSRASSRASS 210 

SEQ ID NO: 6071 TRFAPGTVLPQGFYVEGS-GRSAPASR SGSRSQSRGP 212 

SEQ ID NO: 6051 LQLPQGTTLPKGFYAEGSRGGSQASSR SSSRSRGNSR 196 

SEQ ID NO: 6075 LRFSDGGPDGNFRWDFIPLNRGRSGRS TAASSAAASR 183 



SEQ ID NO: 6056 PQSRNPSSDRNHN SQDDIMKAVAAALKSLGFDKPQEKDKKS 205 

SEQ ID NO: 6064 ASQNRGGNNNNNNKSRNQSNNRNQSNDRGGVTSRDDLVAAVKDALKSLGIGENPDRHKQ- 249 

SEQ ID NO: 6060 SQSRGRQQFNNKK DDSVEQAVLAALKKLGVDTEKQQQRS- 215 

SEQ ID NO: 6068 AGSRSRANSGNRT PTSGVT PDMADQI ASLVLAKLGKDAAKP 251 

SEQ ID NO: 6071 NNRARSSSNQRQ PASTVKPDMAEEIAALVLAKLGKDAGQP 252 

SEQ ID NO: 6051 NSTPGSSRGNS P ARMAS GGGETALALLLLDRLNQLE S KV 235 

SEQ ID NO: 6075 APSREGSRGRR SDSGDDLIARAAKI IQDQ 212 



SEQ ID NO: 6056 
SEQ ID NO: 6064 
SEQ ID NO: 6060 
SEQ ID NO: 6068 
SEQ ID NO: 6071 
SEQ ID NO: 6051 
SEQ ID NO: 6075 



AKTGTPKPSRNQSPASSQTSAKSLARSQSSETKEQKHEMQKPRWKRQPNDDVTSNVTQCF 265 

QQKPKQEKSDNSGKNTPK KNKSRATSKERDLKDIPEWRRIPKG — ENSVAACF 300 

RSKSKERSNSKTRDTTPK NENKHTWKRTAGK GDVTRFY 253 

QQVT KQT AKE I RQKI LN KPRQKRS PNK — QCTVQQCF 286 

KQVTKQSAKEVRQKILN KPRQKRTPNK— QCPVQQCF 287 

SGKGQQQQGQTVTKKSAAEASK KPRQKRTATK- -QYNVTQAF 275 

QKKGSRITKAKADEMAH RRYCKRTIPP — NYRVDQVF 247 



SEQ ID NO: 6056 GPRDLDH NFGSAGWANGVKAKGYPQFAELVPSTAAMLFDSHIVSKESG 314 

SEQ ID NO: 6064 GPRGGFK— NFGDAEFVEKGVDASGYAQIASLAPNVAALLFGGNVAVRELA 349 

SEQ ID NO: 6060 GARSSSA NFGDTDLVANGSSAKHYPQLAECVPSVSSILFGSYWTSKEDG 302 

SEQ ID NO: 6068 GKRGPNQ NFGGGEMLKLGTSDPQFPILAELAPTAGAFFFGSRLELAKVQNLSGNLDE 343 

SEQ ID NO: 6071 GKRGPNQ NFGGSEMLKLGTSDPQFPILAELAPTVGAFFFGSKLELVKKN — SGGADE 342 

SEQ ID NO: 6051 GRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTP 327 

SEQ ID NO: 6075 GPRTKGK-EGNFGDDKMNEEGIKDGRVTAMLNLVPSSHACLFGSRVTPKLQL 298 

* * *** * . * . . 



SEQ ID NO: 6056 — NTVVLTFTTRVTVPKDHP HLGKFLEELNAFTR EMQ 349 

SEQ ID NO: 6064 — DSYEITYNYKMTVPKSDP NVELLVSQVDAFKTGNAK-LQRKKEKKNKRETTLQ 401 

SEQ ID NO: 6060 — DQIEVTFTHKYHLPKDDP KTGQFLQQINAYAR PSEVAKEQR 343 

SEQ ID NO: 6068 PQKDVYELRYNGAIRFDSTLSGFETIMKVLNENLNAYQQQD GTMNMSPKPQRQ — R 397 

SEQ ID NO: 6071 PTKDVYELQYSGAVRFDSTLPGFETIMKVLNENLNAYQKDG GADWS PKPQRKGRR 398 

SEQ ID NO: 6051 - - - SGTWLT YHGAI KLDDKD PQFKDNVI LLNKH IDAYKTFP PTE 368 

SEQ ID NO: 6075 DGLHLRFEFTTWPCDDPQFDNYVKICDQCVDGVGTRPKDDEPKPKSRSSSRPATRG 355 



SEQ ID NO: 6056 QHPLLNPSALEFNPSQTSP ATAEPVRDEVS I ETDI I DEVN 389 

SEQ ID NO: 6064 QHEEAIYDDVGAPSDVTHANLEWDTAVDGGDTAVEIINEIFDTGN 446 

SEQ ID NO: 6060 KRKSRS KSAERS EQDWPDALI ENYT DVFDDTQVE 1 1 DEVTN 385 

SEQ ID NO: 6068 GQKNGQGENDNISVAAPKSRVQQNKIRELTAEDISLLKKMDEP FTEDTSEI 448 
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SEQ ID NO: 6071 QAQEKKDEVDNVSVAKPKSSVQRNVSRELTPEDRSLLAQILDDGWPDGLEDDSNV 454 

SEQ ID NO: 6051 -PKKDKKKKTDEAQPLPQRQKKQPTVTLLPAA — 399 

SEQ ID NO: 6075 NSPAPRQQRPKKEKKLKKQDDEADKALTSDEERNNAQLEFYDEPKVINWGDAALGENEL 414 



FIGURE 7E 



FIGURE 7F 



01 
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FIGURE 7G 
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TGV 
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0.1 
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FIGURE 8 

- - - - Section 1 

(1 ) 1 1 0_ 20 30 _ _ 42 

avian 18V partial 5'UTR 161- (1) TAT T A A A A T C T T ATTGTTGCTGGT AT C AC T G CT T G T T T T G C C 

HCoV-OC43 5'UTR (1) 

bovine CV 5'UTR (1) 

Consensus (1) 

_ — - - - -•- » Section 2 

(43) 43 50 60 70 84 

avian IBV partial 5'UTR 161- (43) G T G TCT C AC T T T A T A C AT C T G T T GCT T G G G C T AC C T A GT G T C 

HCOV-OC43 5'UTR (1) 

bovine CV 5'UTR (1) 

Consensus (43) 

Section 3 

(85) 85 £0 /1 00 : 110 126 

avian IBV partial 5'UTR 161- (85) C A G C G T C C T AC G G G CG TCGTGGCT G G T T C GAG T G C G AG G A A C 

HCOV-OC43 5'UTR (1) 

bovine CV 5'UTR (1) - 

Consensus (85) 

Section 4 

(127) 127 140 150 _ 168 

avian IBV partial 5'UTR 161- (127) CT C TGGTTC A T CTAG C G G T AG G C G G G T GT G TGGA A G T A G C A C 

HCOV-OC43 5'UTR (1) GATTS TG ;.GCGATT f !'GCGT } G p GTGC A- -|cc£gC 

bovine CV 5'UTR (1) GATTGCGAGC G A T T T N G C G TG'. GTGCA- -TOCCGO 

Consensus (127) GAT TG GAGCG AT TTGCGT GCGTGC A TCCCGC 

- Section 5 

(169) 169 180 /1 90 200 210 

avian IBV partial 5'UTR 161- (169) TT Ci\G AC GTACpGG 'I*':* . T.GT T ( f ll G T G A? A A <fc A — CGGGGS C] A C 

HCOV-OC43 5'UTR (33) T T C A - CT G A 7 C %C T T G T TAG A T C .TT TTTGT AATG.T A 

bovine CV 5'UTR (33) TTC A - - - '§T& AT C T CT *; C T T AG AT C T T T T C A T A AT GT A 

Co nsensu S ( 1 69) T TCA C T GAT C T C T TGT T AG A T CT T T T C G T AATCTA 

• Section 6 

(211) 211 220 230 240 252 

avian IBV partial 5'UTR 161- (209) CT^CCCCCACATACCTCTAAGGGCTTTTG AGCCT AGGGT7GG 
HCOV-OC43 5'UTR (68) A ACT T TAT A AAA AC A T C C AC T C C C TGT A A T C T AT G C T T GTGci 
bovine CV 5'UTR (68) A A£$T T T.AT^A^AAj^AT-;gCA;C;T CCgTGigAGTCTATGC C T GT G G, 
Consensus (211) A AC T T T A T A A A A AC AT CC AC T C C CT G T A G T C T A T G CCTG T G G 

Section 7 

(253) 253 260 270 280 294 

avian IBV partial 5'UTR 161- (251) GCT AC c 
HC0V-OC43 5'UTR (110) GCGTAGJ 
bovine CV 5'UTR (1 10) jfrcc T AC 

Consensus (253) GCGT AGATTTTTC ATAGTGGTGTCT ATATT CATTTCT GCT 

- Section 8 

(295) 295 300 310 320 336 

avian IBV partial 5'UTR 161- (293) ,G G T A G T G C C AAA C A A CCCC T G A G 'G T G A C A G G f B T C T G G* T G G T G 

HCoV-OC43 5'UTR (150) G T T AjA C A G^T T T pAG CCACGGAC '<S T GtT T G T A;TjcS||A^G]G C 

bovine CV 5'UTR (150) <£t|S|2a C A GgT T T&$g||OA G G&Mc & l'fe;T T G T A^C^^A^G C 

Consensus (295) G T T A A C A GCT TTC AG CC A G G G AC GTGT TGT AT CCTAGGC 

— - Section 9 

(337) 337 350 ^360 _ 373 

SEQ ID NO: 9910 
SEQ ID NO: 9919 
SEQ ID NO: 9892 

Consensus (337) AGTG GCCCACCC ATAGGTC AC AATG 
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FIGURE 9 



f1:AT ^ g a g c% aJ tw g ! gtg 



SEQ ID NO: 

(136-154 nt) 6021 



F2: GTG^GTG^AT^C ^CTTCA 

C C CC G 



F3: CTTCACf G^TCT%TGT|fGA 

l A C TA 



(152-172 nt) 



(168-195nt) 



6022 



6023 



R1 : AG^ A CCTGT CAC ^TC -^GG ^TG 
G TACAA G CCT C 



R2: AAAf G^TATA^cffc fLjATG 



(307-329nt) 

•t 

(265-288nt) 



R3: Ct^C tttTATG ttAtA *C-Si7rGCCCA (250-274nt) 



6024 
6025 
6026 



AC AC 



AA A T TAC 
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FIGURE 10 



avian IBV 3'UTR (NC_001451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



avian IBV 3'UTR (NCJ301451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



avian IBV 3'UTR (NCJK31451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



(D 
(D 
(D 
(D 
(1) 

(37) 
(37) 
(D 
(D 
(37) 

(73) 
(73) 
(D 
(1) 
(73) 



1 



10 



20 



(109) 

avian IBV 3'UTR (NC_001451) 27103- (109) 
HCoV-OC43 3'UTR partial (1) 
bovine CV 3'UTR (1) 
Consensus (109) 

(145) 

avian IBV 3'UTR (NCJXM 451 ) 27103- (145) 
HCoV~OC43 3'UTR partial (1) 
bovine CV 3'UTR (1) 
Consensus (145) 

(181) 

avian IBV 3'UTR (NC_001451 ) 27103- (179) 
HCoV-OC43 3'UTR partial (14) 
bovine CV 3'UTR (11) 
Consensus (181) 

(217) 

avian IBV 3'UTR (NC_001451) 27103- (215) 
HCoV~OC43 3'UTR partial (48) 
bovine CV 30JTR (45) 
Consensus (217) 



Section 1 

_ 36 

GTAACATAATGGACCTGTTGTTTCCTGGTACATTTT 



Section 2 

37 50 _ 60 72 

G T T A A A C AC- T A T T T C T GT G C T T T C C T A T C A A T T A T T 



Section 3 

73 80 90 108 

ACAGGCATTGATTGTGATTATGTTCAATACTTAAGC 



Section 4 
144 



109 120 130 

T T CTTT T G GTTGCT T T TTGC T T'A TTGT A T T G T TGCT 



Section 5 

145 150 160 170_ 180 

G T G C TT T T T AT T G T T G T G A T T C T C A T T AGT T?Ec| - - : § 

TAAGA . * IVY CI A AO 

GACAA AJTGAAG 

A GAGA ATGA AC 
■ Section 6 

181 .190 200 216 

aSgSaP 

c T t a t ~ g t < : ' v ; caJc c t g g-t!g g t a a: c c c c r' ; c J Gck c g 

CTTAT GTC G G CAC C T G GT GGTAA CCCC T C GCAGG 

Section 7 

217 230 240 ^ _ „.?5? 

St a g g 5a t jSt a g c t t g^t ajc; : ? a o a t c tctatcgc 



AA AG T € C CG ™ 
AAAG-Tg<5G|-- 
AAAGTCGGG 



ATAAGGCAC TCTCTATCAG 
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FIGURE 10 (contd.) 



Section 8 



(253 

avian IBV 3'UTR (NCJ)01451) 27103- (251 
HCoV-OC43 3'UTR partial (76 
bovine CV 3'UTR (73 
Consensus (253 

(289 

avian IBV 3'UTR (NCJ)01451) 27103- (287 
HCoV-OC43 3'UTR partial (107 
bovine CV 3'UTR (104 
Consensus (289 



(325 

avian IBV 3'UTR (NC_001451) 27103- (323; 

HCoV-OC43 3'UTR partial (139; 

bovine CV 3'UTR (136; 
Consensus (325; 

(361 

avian IBV 3'UTR (NC 001451) 27103- (358 
HCoV~OC43 3'UTR partial (175 
bovine CV 3'UTR (172 
Consensus (361 



(397 

avian IBV 3'UTR (NC_001451) 27103- (393 
HCOV-OC43 3'UTR partial (206 
bovine CV 3'UTR (203 
Consensus (397 

(433 

avian IBV 3'UTR (NCJ)01451) 27103- (429 
HCOV-OC43 3'UTR partial (238 
bovine CV 3'UTR (235 
Consensus (433! 



(469 

avian IBV 3'UTR (NCJ)01451) 27103- (462 
HCoV-OC43 3'UTR partial (274 
bovine CV 3'UTR (271 



(505 

avian IBV 3*UTR (NCJ)01451) 27103- (498; 

HCoV~OC43 3'UTR partial (293 
bovine CV 3'UTR (290; 
Consensus (505; 



253 260 

AgT fill - - fflffl gjjST GCT.G C||Ag|g|A|gA G / 
AATGG A T G T CTT G C T G C T ATA AT A G AT AG A 




289 



300 



310 



A AG G T X A T AGC AG AC T A T AG ATT AATTAGTTG 

-■ - Section 10 

3 25 330 340_ 350 360 

Bt t |^PaS|tSaSt t a a g t fillip ^Sa^MtSgS 

?.-•;.-■.*•' - ■ • < * , 1 I ' 

fT.^T-. = T; '— m,rri rp.nr. r'-rr* f- <rrr- r- t-, n a rr» /-*■ rp 7\ m tv' r*-rr\ r* nr. <~r-.r- t\ ^ t>. ;a;tv 



aaagttttgtgtggtaatgtatagtgttggagaaag 

Section 1 1 

361 _ .370 380 _ _ 396 

^AT'AAAGA T G C C ACT GtC G;G Gpr cJ/ic-GCGffiiG T§C 



{TlG-MAKGKCT-- 
•JgjG - AA AGAG T 
TG A A AG ACT 



ig!aa,g a at i c a c G AC a Kg 



TC^CGG^AgTAgT TGC.C^OA^G 
TGCG G A AG T A AT TGCCGAC A AG 

— - Section 12 

397 4 10 420 _ 432 

G A TGGAG GJ?T AG AGC AC [' A G'C A GGC C C AT T AG GGG^j 

Tr ACC AA -V,GG ^AGAGCGAGCAi G TT.AAGTTl 

T G cgc|kG.f G G Agft GgC AGC AT | TTAA.^T.tI 

T GCCC A AGGGG A AG AGC C AGC ACG TTAAGTTA 

Section 13 

433 440 _ _ 450 ' _ 468 

A _ ACG ' . . ' A T 'i G T - - T 7-A A A T T A A A f Z T^/ 

(A 

oc 

CCA C C A G T A A T T A G T A A A T G A A T G A A G T T A A T T A T 

- - Section 14 

469 _ 480 4?JL 504 

.G G CAP AA/G $ A T 5SG T T SSa AgTTT A T A GGCTAGTAl^GA 



G G C'C A AT TG GM G A&T C A C 

g a; cc a at ;r c; gaaga at gag 



Section 15 



505 513 

GTTAGAGCA 



SEQ ID NO 
SEQ ID NO 
SEQ ID NO 



9911 
9920 
9893 
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FIGURE 11 



F-1 TCTATC-t^tA | GGATGTCT 
AGA T 



(245 ~ 265 nt) 



SEQ ID NO: 

6027 



F-2 TTAGTT J AA — TTT 7 GT 1 T- GT (318 - 339 nt) 
G AG T G G 



6028 



F-3 TAGTGTT | GAG j A | Gt| TAAAGA ( 346 - 368 nt) 



R-1 A^TT^GCCATA^T JAACTT 



(458 - 476 nt) 



6029 
6030 



R-2 ACTAA^^T^G ^T-^T^C — TAA 
AATT A T CT CC C 



R-3T — TC— GC-^T-^C-^C— GCA 
AC C G GG CC G 



( 426 - 448 nt) 



( 375 - 395 nt) 



6031 

6032 
6033 



FIGURE 12 



Coi Is output for unknown 



i r~ 



j j no ■•.< ■■■ •■■■ 1.4 



I I 1 1 I I 

388 408 608 808 1880 1200 1400 
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1000 1200 1400 



1a 



1a 




Z. 



1a 



1a 



// 



1a 



77" 



FIGURE 13 



1b 




- G.O.I. 



1b 




EM N 

m-cr>cz 



EM N 



1b 




G.O.I. 








E 


M 



G.O.I.- 



1b 



E M 



N 



1b 





H h 

E 




G.O.I. 


s 


M N 




H H 




-G.O.I. 
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Catalytic His 



SEQ ID NO: 6569 

1 sgfrtatiafps gkvegcrtivqv t>ctatttlngl ulddtvycpr llwictaedrnl npnyedllir 

61 ksnhsf lvqa gnvqlrvigh sracmcllrlk vdtsnpktpk ykfvriqpgq tfsvlacyng 

121 spsgvyqcara rpnht-ikgqfjlngscjgsvgf nidydcvsfc |~yfcfh)irnelptg vhagtdlegk 

181 fygpfvdrqt, aqaagtdtti>^vlawlya av i ngdi: wf j/mSt 1 1 1 ndf nlvamkynye 

241 pltqdhvdil gplsaqtgia vldh^aalke llqngmngi^ /llgstilede ftpfdwrqc 

301 sgvtfq 




Key residues of the substrate site 
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FIGURE 16 



avian IBV n$p2 
MHV nsp2 
SARS nsp2 
BCoV nsp2 
Consensus 



avian IBV nsp2 
MHV nsp2 
BARS nsp2 
BCoV nsp2 
Consensus 



(52 
(49 
(52 
(52 
(52 
(52 



(103 

avian IBV nsp2 (100 
MHVn$p2(102 
SARS nsp2 (102 
BCoV nsp2 £102 
Consensus 1103 



{154 

avian iBVnsp2{l51 
MHV nsp2 (153 
SARS nsp2 (153 
BCoV nsp2(153 
Consensus (154 



{205 ! 

avian IBV nsp2 (202 
MHV n$p2 {204 
SARS nsp2 (204 
BCoV nsp2 (204 
Consensus {205 



1258; 

avian IBV nsp2 (251 
MHV mp2 (247 
SARS nsp2 |249; 
BCoV nsp2 (247 
Consensus (2SB[ 



(307 

avian IBV nsp2 (301 
MHV nsp2 (297 
SARS nsp2 (300 
BCoV nsp2 (297 
Consensus (307 



_29 



,30 



i n . a „ JO 

S G F K K | V S PBS'AV F KC TVS v(y R G WW LWGL W L <3 d|H| YCPR 
SGBlfKMVS ™SKVE:PCXVSVTYGNMi:TI*WGr.VJLODKVYCPR 
5GFRKMAF pJoKVEGcSvQVTCGTTTLNCJ LttLDDgV YC PR 



Section 1 

51 

vj|G - - ■ - K F jjc 
VICSjiDMTD 
ICBaEDMLN 

[SKVEPCXVSVTYGKMTbNGLWLDDKV^CPRyVICSAjDMTN 
SGI VKMVS PSSKVEPCI VSVTYGMMTLNGLWLDDTVYC PRHVICSAADMTN 

^ ^ — «. ....^ — — Section 2 

52 _ ... gO 70 .80 g0 . 102 

& 0|jj L N I* A M H H§§ F E ^ f ^fflQ H G N * / B'S R R§§ K G A|| Ljf L Q T A ||a N A E T P K Y 
PDYPitLLCRVTlB^ 

PNYEDLLIRkJ^NHS Pl*VQ AGN ~VQI*Rv||GjSMQNCBt»RIjK VDTSNP8h k PK' Y 
F D Y Tfj§L I- OR V p ' ~ F D & * BM V M S Y QHQG Cjfp> V b V'T X* Q N S S*r P K V 
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FIGURE 17 
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FIGURE 18 
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FIGURE 19 
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FIGURE 20 
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FIGURE 21 A 
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FIGURE 23 

LPRKSQPTSI SCRSVL-TNFKICVAVARLHA-CTYAV-TI INFTVVDKKRVTRPSSADCL 
RFRPCCSRSSAYLGFVRV- PKGKMESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEE 
ALSEAREHLKNGTCGLVELEKGVLPQLEQPYVFIKRS DALSTNHGHKVVELVAEMDGIQY 
GRSGITLGVLVPHVGETPIAYRNVLLRKNGNKGAGGHSYGI DLKSYDLGDELGTDPIEDY 
EQNWNTKHGSGALRELTRELNGGAVTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSE 
QLDYIESKRGVYCCRDHEHEI AWFTERS DKS YEHQTPFEIKSAKKFDTFKGECPKFVFPL 
NSKVKVIQPRVEKKKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTCDFL 
KATCEHCGTENLVI EGPTTCGYLPTNAVVKMPCPACQDPEIGPEHSVADYHNHSNIETRL 
RKGGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGHTGITGDNVETLNEDLLEILSRE 
RVNINI VGDFHLNEEVAI I LAS FSASTSAFI DTIKSLDYKSFKTI VESCGNYKVTKGKPV 
KGAWNIGQQRSVLTPLCGFPSQAAGVIRSI FARTLDAANHSI PDLQRAAVTILDGI SEQS 
LRLVDAMVYTSDLLTNSVI IMAYVTGGLVQQTSQWLSNLLGTTVEKLRPI FEWI EAKLSA 
GVEFLKDAWEILKFLI TGVFDIVKGQIQVASDNIKDCVKCFI DVVNKALEMCI DQVTIAG 
AKLRSLNLGEVFIAQSKGLYRQC IRGKEQLQLLMPLKAPKEVTFLEGDSHDTVLTSEEVV 
LKNGELEALETPVDSFTNGAI VGTPVCVNGLMLLEI KDKEQYCALSPGLLATNNVFRLKG 
GAPIKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNEKCSVYTVESGTEVTEFACVVA 
EAVVKTLQPVSDLLTNMGI DLDEWSVAT FYLFDDAGEENFSSRMYCS FYPPDEEEE DDAE 
CEEEEI DETCEHEYGTEDDYQGLPLEFGASAETVRVEEEEEEDWLDDTTEQSEIEPEPEP 
TPEEPVNQFTGYLKLTDNVAIKCVDI VKEAQSANPMVI VNAANIHLKHGGGVAGALNKAT 
NGAMQKESDDYIKLNGPLTVGGSCLLSGHNLAKKCLHVVGPNLNAGEDIQLLKAAYENFN 
SQDI LLAPLLSAGI FGAKPLQSLQVCVQTVRTQVYI AVNDKALYEQVVMDYLDNLKPRVE 
APKQEEPPNTEDSKTEEKSVVQKPVDVKPKIKACI DEVTTTLEETKFLTNKLLLFADING 
KLYHDSQNMLRGEDMS FLEKDAPYMVGDVITSGDI TCVVI PSKKAGGTTEMLSRALKKVP 
VDEYITTYPGQGCAGYTLEEAKTALKKCKSAFYVLPSEAPNAKEEILGTVSWNLREMLAH 
AEETRKLMPICMDVRAIMATIQRKYKGIKIQEGI VDYGVRFFFYTSKEPVAS I ITKLNSL 
NEPLVTMPI GYVTHGFNLEEAARCMRSLKAPAVVSVSSPDAVTTYNGYLTSSSKTSEEHF 
VETVSLAGSYRDWS YSGQRTELGVEFLKRGDKI VYHTLES PVEFHLDGEVLSLDKLKSLL 
SLREVKT IKVFTTVDNTNLHTQLVDMSMT YGQQFGPT YLDGADVTKI KPHVNHEGKTFFV 
LPSDDTLRSEAFEYYHTLDESFLGRYMSALNHTKKWKFPQVGGLTS IKWADNNCYLSSVL 
LALQQLEVKFNAPALQEAYYRARAGDAANFCALI LAYSNKTVGELGDVRETMTHLLQHAN 
LESAKRVLNVVCKHCGQKTTTLTGVEAVMYMGTLS YDNLKTGVS I PCVCGRDATQYLVQQ 
ESS FVMMSAPPAEYKLQQGTFLCANE YTGNYQCGHYTHITAKETLYRI DGAHLTKMSEYK 
GPVT DVFYKETSYTTTIKPVSYKLDGVT YTEIEPKLDGYYKKDNAYYTEQPI DLVPTQPL 
PNAS FDNFKLTCSNTKFADDLNQMTGFTKPASRELSVTFFPDLNGDVVAI DYRHYSAS FK 
KGAKLLHKPI VWHINQATTKTTFKPNTWCLRCLWSTKPVDTSNS FEVLAVEDTQGMDNLA 
CESQQPTSEEVVENPT IQKEVIECDVKTTEVVGNVILKPSDEGVKVTQELGHEDLMAAYV 
ENTS ITIKKPNELSLALGLKTI ATHGIAAINSVPWSKILAYVKPFLGQAAI TTSNCAKRL 
AQRV FNNYMPYVFTLLFQLCTFTKSTNSRIRASLPTTI AKNSVKSVAKLCLDAGINYVKS 
PKFSKLFT I AMWLLLLS I CLGSLI CVTAAFGVLLSNFGAPSYCNGVRELYLNSSNVTTMD 
FCEGS FPCS ICLSGLDSLDSYPALETIQVTI SS YKLDLTILGLAAEWVLAYMLFTKFFYL 
LGLSAIMQVFFGYFASHFI SNSWLMWFI I S I VQMAPVSAMVRMYI FFASFYYI WKSYVHI 
MDGCTSSTCMMCYKRNRATRVECTTI VNGMKRS FYVYANGGRGFCKTHNWNCLNCDT FCT 
GSTFI SDEVARDLSLQFKRPINPTDQSSYI VDSVAVKNGALHLYFDKAGQKTYERHPLSH 
FVNLDNLRANNTKGSLPINVI VFDGKSKCDESASKSASVYYSQLMCQPILLLDQALVSDV 
GDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSELAKGVALDGVLSTFVSAARQG 
VVDTDVDTKDVIECLKLSHHSDLEVTGDSCNNFMLTYNKVENMTPRDLGACI DCNARHIN 
AQVAKSHNVSLIWNVKDYMSLSEQLRKQIRSAAKKNNI PFRLTCATTRQVVNVI TTKI SL 
KGGKI VSTCFKLMLKATLLCVLAALVCYIVMPVHTLS IHDGYTNEI I GYKAI QDGVTRDI 
I STDDCFANKHAGFDAWFSQRGGSYKNDKSCPVVAAI ITREIGFI VPGLPGTVLRAINGD 
FLHFLPRVFSAVGNICYTPSKLIEYSDFATSACVLAAECTI FKDAMGKPVPYCYDTNLLE 
GSIS YSELRPDTRYVLMDGSI IQFPNTYLEGSVRVVTTFDAEYCRHGTCERSEVGICLST 
SGRWVLNNEHYRALSGVFCGVDAMNLI ANI FTPLVQPVGALDVSASVVAGGI I AILVTCA 
AYYFMKFRRVFGEYNHVVAANALLFLMS FTI LCLVPAYSFLPGVYSVFYLYLTFYFTNDV 
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SFLAHLQWFAMFSPI VPFWITAI YVFCI SLKHCHWFFNNYLRKRVMFNGVTFSTFEEAAL 
CTFLLNKEMYLKLRSETLLPLTQYNRYLALYNKYKYFSGALDTTSYREAACCHLAKALND 
FSNSGADVLYQPPQTS ITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVY 
CPRHVICTAEDMLNPNYEDLLIRKSNHS FLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPK 
TPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGS FLNGSCGSVGFNI DYDCV 
SFCYMHHMELPTGVHAGT DLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDR 
WFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDI LGPLSAQTGI AVLDMCAALKELLQNGMN 
GRTILGSTI LEDEFTPFDVVRQCSGVTFQGKFKKI VKGTHHWMLLTFLTSLLI LVQSTQW 
SLFFFVYENAFLPFTLGIMAI AACAMLLVKHKHAFLCLFLLPSLATVAYFNMVYMPASWV 
MRIMTWLELADTSLSGYRLKDCVMYASALVLLILMTARTVYDDAARRVWTLMNVITLVYK 
VYYGNALDQAI SMWALVI SVTSNYSGVVTTIMFLARAI VFVCVEYYPLLFITGNTLQCIM 
LVYCFLGYCCCCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKSS I DAFKL 
NIKLLGIGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILL 
AKDTTEAFEKMVSLLSVLLSMQGAVDINRLCEEMLDNRATLQAI ASE FSSLPSYAAYATA 
QEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQiyiYKQARSED 
KRAKVTSAMQTMLFTMLRKLDNDALNNI INNARDGCVPLNI I PLTTAAKLMVVVPDYGTY 
KNTCDGNTFTYASALWEIQQVVDADSKI VQLSEINMDNSPNLAWPLI VTALRANSAVKLQ 
NNELSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVLALLSDHQDLKWARFPKSD 
GTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEV 
PANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQES 
FGGASCCLYCRCHI DHPNPKGFCDLKGKYVQI PTTCANDPVGFTLRNTVCTVCGMWKGYG 
CSCDQLREPLMQSADAST FLNGFAV-VQPVLHRAAQALVLMSSTGLLI FTTKKLLVLQSS 
-KLIAVASRRRMRKAI Y-TLTL-LRGILCLTTNMKRLFI TWLKI VQRLLSMTFSSLE— MV 
TWYHI YHVSV-LNTQWLI -SMLYVILMRVI VI H-KKYSSHTI AVMMI I S IRRIGMTS-RI 
LTS YAYMLT-VSVYANHY— RLYNSAMLCVMQAL— AY— H— I I RI LMGTGT I S VI S YK-HQA 
AEFLLWIHITHC — CPSSL--LGHWLLSPI WMLI SQNHLLSGIC-NMILRKRDFVSSTVI LN 

IGTRHTI PI VLTVWMIGVS FI VQTLMCYFLLCFHLQVLDH EKYL-MVFLLLFQLDT I F 

VS-ESYI IRM-TYI ARVS VSRNF-CMLLIQLCMQLLAI YC- INALHAFQ-LH-QTMLLFK 
LSNPVILIKTFMTLLCLKVSLRKEVLLN-NTSSLLRMATLLSVIMTI I VI ICQQCVI SDN 
SYS-LKLLINTLI VTMVAVLMPTK-SLT I WINQLVSHL INGVRLDFIMTQ- VMRI KMH FS 
RILSVMSSLL-LK- ILSMPLVQRIELAP-LVSLSVVL-QI DS FI RNY- SQ- PPLEELLW- 
LEQAS FT VAGI IC-KLFTVM-KLHTLWVGI I QNVTEPCLTCLG-WPLLFLLAN I TLAVTY 
HTVSTG-LTSVRKY-VRWSCVAAHYMLNQVEHHPVMLQLLMLIVSLTFVKLLQPM-MHFF 

QLMVIR-LTSMSAI YNTGSMSVS IEIGMLIMNSWMS FTLTCVNI SP FFLMMPLCAI TV 

TMRLKV LALRTLRQFFI IKIMCSCLRQNVGLRLTLLKDLTNFAHS IQC-LNKEMITCT 

CLTQIHQE Y-AQAVLSMI LSKQMVHL-LKGSCHWLLMLTHLQNILIRSMLMS FTCI YNTL 
ESYMMSLLATCWTCI P-C-LMITPHGTGNLS FMRLCTHHIQSCRL-VLVYCAIHRLHFVA 
VPVLGDHS YVASAAMTMS FQHHTN-CCLLI PM FAMPQVVMSLM-HNC I -EV-AI I ASHI S 
LPLVFHYVLMVRFLVYTKTHV-AVTMSLTSMR-QHVIGLMLAITYLPTLVLRDSSFSQQK 
RSKPLRKHLSCHMVLPLYAKYSLTENCI FHGRLENLDHH-TETMSLL.VTV — LKI VKYRLE 
STPLKKVTMVMLLCTEVLRHTS-MLVI TLC-HLTL-CHLVHLL-CHKSTM— ELLACTQHS 
TSQMS FLAMLQI IKRSACKSTLHSKDHLVLVRVI LPSDLLSI THLLA-CIRHALMQLLMP 
YVKRH-NICP- INVVESYLRVRA-SVLINSK- IQH-NSMFSAL-MHCQKQLLTL-SLMKS 
LWLLIMT-VLSMLDFVQNTTSI LAILLNYQPPAHC-LKAH-NQNILIQCADL-KQ-VQTC 
SLELVAVVLLKLLTL-VL-FMTI S-KHTRI SQLNASKCSTKVLLHMMFHLQSTDLK-AL- 
ENFLHAILLGEKLFLSHLI I HRTL-LQKS- DCLRRLL I HHRVLNMTMS YSHKLLKQHTLV 
MSTASMWLSQGQKLAFCA— CLIEI FMTNCNLQV-KYHVAMWLHYKQKM-LDFLRTVVRSL 
LVFI LHRHLHTSALI -SSRLKDYVLTYQAYQRT-PTVDSSL-WVSK- ITKSMVTLICLSP 

AKKLFVTFVRGLALM-RAVMQLEMLWVLTYLSS-DFLQVLT LYRLVMLTLKITQNSPE 

LMQNLHQVTSLNILYHSCIKACPGM-CVLR-YKCSVIH-KDCQTESCSSFGRMALSLHQ- 
STLSRLDLKERVVCVTNVQLAFLLHQI LMPAGI ILWVLTMSITHL-LMFSSGALRVTFRV 
TMTNIARYMEMHMWLVVMLS-LDV-QSMSALLSALIGLLNTLL-EMN-GLILLAEKYNTW 
L-SLHCLLI S FQFFMTLE IQRLSSVCLRLK-NGSSTMLSHVVTKLTK-RNSSILMLHITI 
NSLMVFVCFGI VTLI VTQPMQLCVGLTQESCQT-TYQAVMVVVCM- I SMHSTLQLS IKVH 
LLI -SNCLS FTILI VLVSLMANK-CRILIMFHSNLLRVLHDAI -VVLFADTMQMSTDSTW 
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MHII — FLLDLAYGFTNNLILITCGIHLPGYRV-KMWLIMLLIKDTLMDTPAKHLFPSLI 
MLFTQR-MVLMWRSLKIRQHFLLMLHLS FGLSVTLNQCQRLRYSI I WVLI SLLIL-SGTT 
KEKPQHMYLQ-VSAQ-LTLPRNLLRVLVLHLLSCLMVEWKDR-TFLETPViyiVF — QKVQS 
KV-HLQRDQHKLASMESH-LENQ-KHSLTTLRK-TALFNSCLKPTLLRAET-RILSPDHK 

WKLTFSSSLWMNSYSDI SSRAMPSNTS FMEI SVMDNLAVFI A— PSAHKIHHLN-RIL 

SLWTAQ-KI TS-QMRKQVHQNVCVL-LI FYLMTLSR — SHKI CQ- FQKWSRLQLTMLKFH 

SCFGVRMDMLKPSTQNYKQVERGNQVLRCLTCTRCKECFLKSVTFRIMVKMLLYQKE 

MSQS I LNCVNT-IHLL-LYPTT-ELFTLVLALI KELHQVQLCSDNGCQLAHYLS IQILMT 
SSPTHILL-LETVQQYIRLINGTLLLAICMTLGPNM-QKRMTLKKGFSLICVDL— SKN-P 
WVVL-L-R-QSILGMLTFTSLWAISHGGQLLLQM-MHHHRKHF-LGLTILASRRNKLMAI 
PCMLTTFSGGTQILSSCLPIHSLT-ANFLLN-EELL-CLLRRIKSMI -FILFWKKVGLSL 
EKTTELWFQVI FLLTTKRTCLFS YYFLLSLVVVTLTGAPLLMMFKLL I TLN I LHL^GGFT 
ILMKFLDQTLFI-LRI YFFHFILMLQGFILLI I RLATLS YLLRMVFI LLPQRNQMLSVVG 
FLVLP-TTSHSR— LLLTI LLMLLYEHVTLNCVTTLSLLFLNPWVHRHIL— YS IMHLI ALS 
STYLMPFRLMFQKSQVI LNTYESLCLKIKMGFSMFIRAINL-M-FVI YLLVLTL-NLFLS 
CLLVLTLQI LEPFLQPFHLLKTFGARQLQPI LLAI -SQLHLCSSMMKMVQSQMLLI VLKI 
HLLNSNALLRALRLTKEFTRPLI SGLFPQEML-DSLILQTCVLLERFLMLLNSLLSMHGR 
EKKFLIVLLITLCSTTQHFFQPLSAMAFLPLS-MI FASPMSMQILL-SREMM-DK-RQDK 
LVLLLI I I INCQMI SWVVSLLGI LGTLMLLQLVI I I INIGILDMASLGPLRETYLMCLSP 
LMANLAPHLLLI VIGH-MIMVFTPLLALATNLTEL- YFLLNF-MHRPRFVDQNYPLTLLR 
TSVSILILMDSLVLVC-LLLQRDFNHFNNLAVMFLISLIPFEILKHLKY-TFHLALLGV- 
V-LHLEQMLHLKLLFYIKMLTALIMFLQQFMQINSHQLGAYILLETMYSRLKQAVL-ELSM 
STLLMSATFLLELAFVLVTIQFLYYVVLAKNLLWLILCL-VLIVQLLTLITPLLYLLTFQ 
LALLQK—CLFLWLKPP- I VICTSAEILLNVLICFSNMVAFAHN-I VHSQVLLLNRI ATHV 
KCSLKSNKCTKPQL-NILVVLI FHKYYLTL- SQLRGLLLRTCSLI R-HSLMLAS- SNMAN 
A-VI LMLEI SFVRRSSMDLQCCHLCSLiyil -LLPTLLL-LVVLPLLDGHLVLALLFKYLLL 
CKWHIGSMALELPKMFSMRTKNKSPTNLTRRLVKFKNHLQQHQLHWASCKTLLTRMLKH- 
THLLNNLALILVQFQVC-MI S FRDLI KSRRRYKLTG-LQADFKAFKPM-HNN- SGLLKSG 
LLLILLLLKCLSVFLDNQKELTFVERATTLCPSHKQPRMVLSSYMSRMCHPRRGTSPQRQ 
QFVMKAKHTSLVKVFLCLMALLGLLHRGTS FLHK-LLQTIHLSQEI VMSLLASLTTQFMI 
LCNLSLTHSKKSWTSTSKI I HHQML I LAT FQALTLLS STFKKKLTASMRSLKI — MNHSLT 
FKNWENMSNILNGLGMFGSASLLD-LPSSWLQSCFVA-LVVAVASRVHALVVLAASLMRM 
TLSQFSRVSNYITHKRTYGFVYEI FYSWINYCTASKN-QCFSCKYCSCYSNDTATSLTPF 
RMACYWRC I SCCFSERYQNNCAQ-KMAASPL-GLPVHLQFTAAICYHLFTS FACRCRYGG 
A I FVPLCLDI FSTMHQRM-NYYEMLALLEVQIQEPITL-CQLLCLLAHT-L— LLYT I— QC 
HRYNCRY-R— RHFNTKTQRRLPNWWLF—G-ALRC-RLCRCTWLFHRSLLPA— VYTNYYRH 
WY— KCYI LHL— QAC— RPTECANTHNRRLFRSC-SSNGSNL — ADDDY— RAFVSTRK-VRT 

YVL I RFGRNRYVNS RT S FSC FRGI LASHTSHPYCAS I VCVLLQYC-REFSKTNGLRLL 

AC-KSELF-RSS-SSGLNELTI I I I LFGTLTLLIMADNGT I TVEELKQLLEQWNLVI GFL 
FLAWIMLLQFAYSNRNRFLYI IKLVFLWLLWPVTLACFVLAAVYRINWVTGGIAI AMACI 
VGLMWLS YFVASFRLFARTRSMWSFNPETNILLNVPLRGTI VTRPLMESELVIGAVI IRG 
HLRMAGHSLGRCDI KDLPKEITVATSRTLSYYKLGASQRVGTDSGFAAYNRYRIGNYKLN 
TDHAGSNDNI ALLVQ-VTTDVSSC-LPGYNSRDI DYHYEDFQDCYLES-RYNKFNSET I I 
-ASN-EELFGVR — RTYGVRLSIKRT— KLFSS-H-LYLHLAS YI T IRSVLEVRLYY-KNL 
AHQEHTRAIHHFTLLLTINLH-LALAHTLLLLVLTVLDI PISCVQDQFHQNFSSDKRRFN 
KSSTRHFFSLLLL- YF- YFASPLRERQNE-AHFN-LLFVLFSLSAI PCFNNAYYI LVFTR 

NPGSRRTLYQSLNEHETSHCFDLYFSMQLHMHCSTALCI TSCA— RSL— GTTLGVILI A 

LLGFVL-ERFYLFI DGTLWFKHAHLMLLSTVKIQLVVRL-LGVGT FMKVTKLLHLETYLL 
F-INEQIKMSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQGLPNNTAS 
WFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRVRGGDGKMKELSPRWYFY 
YLGTGPEASLPYGANKEGI VWVATEGALNTPKDHIGTRNPNNNAATVLQLPQGTTLPKGF 
YAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALALLLLDRLNQLE 
SKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAFGRRGPEQTQGNFGDQDL 
IRQGTDYKHWPQI AQFAPSASAFFGMSRIGMEVTPSGTWLTYHGAIKLDDKDPQFKDNVI 
LLNKHIDAYKTFPPTEPKKDKKKKTDEAQPLPQRQKKQPTVTLLPAADMDDFSRQLQNSM 
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SGASADSTQA-TLMMTTQGRWAM-TFSQFRLRYI VYSCAE- I LVTKQHK-V-LTLI SHSN 
L-SMCNIREDLKEPPHFHRGHAEYDRGYSE— C-GELPIWKSPNV-N-F— CYPHVILIAS 
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FIGURE 24 

TQEKPTNLDLL-ICSLNEL-NLCSCRSAACLVHLRS INNNKFYCR-QETSNSSLFCRLLT 
VSSVLQS I I S I PRFRPGVTER-DGEPCSWCQRENTRPTQFACPSG-RRASAWLRGLCGRG 
PIGGT-TPQKWHLWSSRAGKRRTAPA-TALCVH-TF-CLKHQSRPQGR-AGCRNGRHSVR 
S-RYNTGSTRATCGRNPNCI PQCSSS-ER — GSRWS-LWHRSKVL-LR-RAWH-SH-RL- 
TKLEH-AWQWCTP-THS-AQWRCSHSLCRQQFLWPRWVPS-LHQRFSRTRGQVNVHSFRT 
T-LHRVEERCLLLP-P-A-NCLVH-AL--ELRAPDTLRN-ECQEI-HFQRGMPKVCVSS- 
LKSQSHSTTC-KEKD-GFHGAYTLCVPCCI STGV-QYALVYLDEM-SLR-SFMADVRLSE 
SHL-TLWH-KFSY-RTYYMWVPTY-CCSENAMSCLSRPRDWT-A-CCRLSQPLKH-NSTP 

QGR-D-MFWRLCVCLCWLL ACLLGSSC-C- YWLRPYWHYW-QCGDLE-GSP-DTES-T 

C-H-HCWRFSFE-RGCHHFGI FLCFYKCLY-HYKES-LQVFQNHC-VLR-L-SYQGKARK 
RCLEHWTTEI S FNTTVWFSLTGCWCYQINFCAHT-CSKPLNS- FAKS SCHHT-WYF-TVI 
TSCRRHGLYFRPAHQQCHYYGICNWWSCTTDFSVVV-SFGHYC-KTQAYL-iyiD-GET-CR 
S- ISQGCLGDSQI SHYRCF-HRQGSNTGCFR-HQGLCKMLH-CC-QGTRNVH-SSHYRWR 
KVAITQLR-SLHRSKQGTLPSVYTWQGAAATTHAS-GTKRSNLS-R- FT-HST YL-GGCS 
QER— TRSTRDAR — LHKWS YRRHTSLCKWPHALRD-GQRT I LRI VSWFTGYKQCLSLKRG 
CTN-RCNLWRRYCLGSSRLQECENHI-A — TC-QSA — KVLCLHC- I RYRS Y-VCMCCSR 
GCCEDFTTS F-SPYQHGY-S — VECSYILLI — CW-RKLFI TYVLFLLPSR-GRRGRCRV 
-GRRN — NL-T— VRYRG— LSRSPSGI WCLS-NSSS— GRRRGRLAG— YY-AIRD— ARTRT Y 
T-RTS-SVYWLFKTY-QCCH-MC-HR-GGTKC-SYGDCKCC-HTPETWWWCSRCTQQGNQ 
WCHAKGE — LH-AKWPSYSRRVLFAFWT-SC-EVSACCWT-PKCR-GHPAS-GSI-KFQF 
TGHLTCTI VVSRHI WC— TTSVFTSVRADGSYTGLYCSQ-QSSL-AGCHGLS — PEA-SGS 
T-TRGATKHRRFQN-GEICRTEACRCEAKN-GLH — GYHNTGRN-VS YQ-VTLVC- YQW- 
ALP— FSEHA-R-RYVFP— EGCTLHGR-CYH-W- YHLCCNTLQKGWWHY- DALKSFEESAS 
— VYNHVPWTRMCWLYT— GS— DCS-EMQIC ILCTTFRST-C— GRDSRNC I LEFERNACSC 
-RDKKINAYMHGC— SHNGNHPT- V-RN-NSRGHR-LWC P ILLLY — RACS FYYYEAELSK 
-AACHNANWLCDTWF-S-RGCALYAFS-SSCRSVSIITRCCYYI-WIPHFVIKDI-GALC 
RNSFFGWLLQRLVLFRTAYRVRC-IS-AW-QNCVPHSGEPRRVSS-R-GSFT-QTKESLI 
PAGG- DYKSVHNCGQH— S PHTACGYVYDIWTAVWSNILGWC-CYKN-TSCKS-G-DFLCT 

T HTT S FRVLPYS EFSW-VHVCFKPHKEMEI SSSWWFN FN— MG — QLLFV-CFI 

STSTA-SQIQCTSTSRGLL-SPCW-CC-LLCTHTRLQ — NCWRAW- CQRNY DPS STAC - F 
GICKASS-CGV-TLWSENYYLNGCRSCDVYGYSIL — S-DRCFHSMCVWS-CYTISSTTR 
VFFCYDVCTTC-V-ITARYILMCE-VHW-LSVWSLHSYNC-GDPLSY-RSSPYKDVRVQR 
TSD-CFLQGNILHYNHQACVV-TRWSYLHRD-TKIGWVL-KG-CLLYRAAYRPCTNSTIT 

KCEF — FQTHMF-HKIC FKSNDRLHKAS FTRAICHI LPRLEWRCSGY-L-TLFSEFQE 

RC- I TA-ANCLAH- PGYNQDNVQTKHLVFTLSLE YKASRYFKFI -SSGSRRHTRNGQSCL 
-KSTTHL— RSSGKS YHTEGSHRV-RENYRSCRQCHT-T IR-RC— SNTRVRS-GS YGCLCG 

KHKHYH-ET AFTSLRFKNNCHSWYCCN CSLE-N FGLCQT I LRTS SNYNI KLR-E I S 

TTCV-QLYALCVYI I VPI VYFY-KYQF-N- S FTT YNYC-K-C-ECC- IMFGCRH-LCEVT 
QI F- I VHNRYVAIVVKYLLRFSNLCNCCFWCTLI -FWCSFLL-WR- 
L-RFFSLQHLFKWIRLP- FLSSS-NHSGDDFI VQARLDNFRSGR-VGFGI YVVHKI 
RSFS YNAGVLWLFC-SFHQQFLAHVVYH- YCTNGTRFCNG—DVHLLCFFLLHMEELCSYH 
GWLHLFDLHDVL-AQSCHTR-VYNYC-WHEEI FLCLCKWRPWLLQDSQLELSQL-H I LHW 

-YIH SCS-FVTPV-KTNQPY-PVIVYC — CCCEKWRAS PLL-QGWSKDL-ETS ALPF 

CQFRQFES-QH-RFTAY-CHSF-WQVQMRRVCF~VCFCVLQSADVPTYSVA-PSSCIRRW 
R-Y-SFR-DV-CLCRHLFSNF-CSYGKT-GTCCYSSQRVSKGCSFRWCPFYIRVSCPTRC 

C-YRC-HKGCY-MSQTFTSL-LRSDR-QL-QFHAHL G-KHDAQRSWRMY-L-CKAYQC 

PSSKKSQCFTHLECKRLHVFI -TAA-TNS-CCQEEQHTF-TNLCYN-TGCQCHNY-NLTQ 
GW-DC-YLF-TYA-GHIIVRSCCIGLLYRYASTYIVNP-WLHK-NHWLQSHSGWCHS-HH 
FY — LFCK-TCWF-RMV-PAWWFIQK-QKLPCSSCYHYKRDWFHSAWLTGYCAESNQW-L 
LAFSTSCF-CCWQHLLHTFQTH-V — FCYLCLRSCC- VYNF-GCYGQTCAI LL-H- FARG 
FYFL — ASSRHSLCAYGWFHHTVS-HLPGGFC-SSNNF-C-VL-TWYMRKVRSRYLPIYQ 

W-MGS ALQSSIRSFLWC-CDESHS-HLYSSCATCGCFRCVCFSSGWWYYCHIGDLCC 

LLLYEIQTCFW-VQPCCCC-CTFVFDVFHYTLSGTSLQLSAGSLLSLLLVLDILFHQ-CF 
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ILGSPSMVCHVFSYCAFLDNSNLCILYFSEALPLVL-QLS-EKSHV-WSYI-YLRGGCFV 
YLFAQQGNVPKI A-RDTVATYTV-QVSCS I-QVQVFQWSLRYYQLS-SSLLPLSKGSK-L 
-QLRC-CSLPTTTDINHFCCSAEWF-ENGI PVRQS-RVHGTSNLWNYNS-WI VVG-HSIL 
SKTCHLHSRRHA-S-L-RSAHSQIQP-LSCSGWQCSTSCYWPFYAKLSA-A-S-YF-P-D 
TQV-ICPYPTWSNIFSSSMLQWFTIWCLSVCHET-SYH-RFFP-WIMW-CWF-H-L-LRV 
FLLYASYGASNRSTRWY-LRR- ILWSIC-QTNCTGCRYRHNHNIKCFGMAVCCCYQW — V 
VS — IHHYFE-L-PCGNEVQL-TFDTRSC-HIGTS FCSNRNCRLRYVCCFERAAAEWYEW 
SYYPW-HYFRR-VYTI -CC-TMLWCYLPR-VQENC-GHSSLDAFNFLDI TI DSCSKYTVV 
TVFLCLRECFLAI YSWYYGNCCMCYAAC-A-ARILVLVSVTFSCNSCLL-YGLHAC-LGD 
AYHDMA- IG-H-LVWL-A-GLCYVCFSFS FAYSHDSSHCL — CC-TCLDTDECHYTCLQS 
LLW-CFRSSYFHVGLSYFCNL-LFWCRYDYHVFS-SYSVCVC-VLPIVIYYWQHLTVYHA 
CLLFLRLLLLLLLWPFLFTQPLLQAYSWCL-LLGLYTRI-VYELPGAFAS-E-Y-CFQA- 
H-VVGYWR-TMYQGCYCTV-NV-RKVHICGTALGSSTT-SRVI F- I VGTMCTTPQ- YSSC 
KRHN-SFREDGFSFVCFAIHAGCCRH — VVRGNAR- PCYS SGYC FRI - FFT I I CRLCHCP 
GGL-AGCS-W-F-SRSQKVKEI FECG-I -V-P-CCHATQVGKDGRSGYDPNVQTGKI -GQ 

EGKSN-CYANNALHYA-EA CT-QHYQQCA-WLCSTQHHT I DYS SQTHGCCP-LWYLQ 

EHL-W-HLYICICTLGNPASC-CG-QDCST — N-HGQFTKFGLASYCYSSKSQLSC-TTE 
— TESSSTTTDVLCGWYHTNSLY — QCTCLL-QFEGR-VCAGITIRPPRSQMG-I P-E-W 
YRYNLHRTGTTL-VCYRHTKRA— SEILVLHQRLKQPK— RYGAGQFSC YSTSSGWKCYRST 
CQFNCAFLLCFCSRPC-S I -GLPSKWRTTNHQLCEDVVYTHWYRTGNYCNTRS-HGPRVL 

WWCFMLSVL— MPH- PSKS-RIL— LER-VRPNT YHLC PSGFYT— KHSLYRLRNVERLWL 

-L-PTPRTLDAVCGCINVFKRVCGVSAARLTPCGTGTSTDVVYRAFDI YNEKVAG FAKFL 
KTNCCRFQEKDEEGNLLDS YFVVKRHTMSNYQHEETI YNLVKDCPAVAVHDFFKFRVDGD 
MVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENP 
DILRVYANLGERVRQSLLKTVQFCDAMRDAGIVGVLTLDNQDLNGNWYDFGDFVQVAPGC 
GVPI VDS YYSLLMPI LTLTRALAAESHMDADLAKPLI KWDLLKYDFTEERLCLFDRYFKY 
WDQTYHPNCINCLDDRCI LHCAN FNVLFSTVFPPTSFGPLVRKI FVDGVPFVVSTGYHFR 
ELGVVHNQDVNLHSSRLS FKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQT 
VKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAI SDYDYYRYNLPTMCDIRQL 
LFVVEVVDKYFDCYDGGC INANQVI VNNLDKSAGFPFNKWGKARLYYDSMS YEDQDALFA 
YTKRNVI PT I TQMNLKYAI SAKNRARTVAGVS I CSTMTNRQFHQKLLKS I AATRGATVVI 
GTSKFYGGWHNMLKTVYS DVETPHLMGWDYPKCDRAMPNMLRIMASLVLARKHNTCCNLS 
HRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLS 
TDGNKI ADKYVRNLQHRLYECLYRNRDVDHEFVDEFYAYLRKHFSMMILS DDAVVCYNSN 
YAAQGLVAS IKNFKAVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYL 
PYPDPSRI LGAGCFVDDI VKTDGTLMIERFVSLAI DAYPLTKHPNQEYADVFHLYLQYIR 
KLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCG 
ACIRRPFLCCKCCYDHVI STSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMS YYCKSHKP 
PI SFPLCANGQVFGLYKNTCVGSDNVTDFNAI ATCDWTNAGDYI LANTCTERLKLFAAET 
LKATEETFKLS YGI ATVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGE 
YTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLN 
I SDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARI VYTACSHAAVDAL 
CEKALKYLPI DKCSRI I PARARVECFDKFKVNSTLEQYVFCTVNALPETTADI VVFDE I S 
MATNYDLSVVNARLRAKHYVYI GDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMF 
LGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVR 
EFLTRNPAWRKAVFI SPYNSQNAVASKI LGLPTQTVDSSQGSEYDYVI FTQTTETAHSCN 
VNRFNVAITRAKIGI LCIMSDRDLYDKLQFTSLEI PRRNVATLQAENVTGLFKDCSKI IT 
GLHPTQAPTHLSVDIKFKTEGLCVDI PG I PKDMTYRRLI SMMGFKMN YQVNGYPNMFI TR 
EEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLGFSTGVNLVAVPTGYVDTENNTEFTRV 
NAKPPPGDQFKHLI PLMYKGLPWNVVRIKIVQMLSDTLKGLS DRVVFVLWAHGFELTSMK 
YFVKIGPERTCCLCDKRATCFSTSSDTYACWNHSVGFDYVYNPFMI DVQQWGFTGNLQSN 
HDQHCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWSVEYPI IGDELRVNSACRKVQHMV 
VKSALLADKFPVLHDIGNPKAIKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATHHDK 
FTDGVCLFWNCNVDRYPANAI VCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAF 
TNLKQLPFFYYSDS PCESHGKQVVSDI DYVPLKSATC I TRCNLGGAVCRHHANE YRQYLD 
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AYNMMI SAGFSLWI YKQFDTYNLWNTFTRLQSLENVAYNVVNKGHFDGHAGEAPVS I INN 
AVYTKVDGI DVEI FENKTTLPVNVAFELWAKRNIKPVPEIKI LNNLGVDIAANTVI WDYK 
REAPAHVST IGVCTMTDI AKKPTESACSSLTVLFDGRVEGQVDLFRNARNGVLITEGSVK 
GLTPSKGPAQASVNGVTLIGESVKTQFNYFKKVDGI IQQLPETYFTQSRDLEDFKPRSQM 
ETDFLELAMDEFIQRYKLEGYAFEHI VYGDFSHGQLGGLHLMIGLAKRSQDS PLKLEDFI 
PMDSTVKNYFITDAQTGSSKCVCSVI DLLLDDFVEI I KSQDLSVI SKVVKVT I DYAEI SF 
MLWCKDGHVETFYPKLQASRAWQPGVAMPNLYKMQRMLLEKCDLQNYGENAVI PKGIMMN 
VAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDF 
VSDAYSTLIGDCATVHTANKWDLI I SDMYDPRTKHVTKENDSKEGFFTYLCGFIKQKLAL 
GGSI AVKITEHSWNADLYKLMGHFSWWTAFVTNVNASSSEAFLIGAN YLGKPKEQI DGYT 
MHANYI FWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQINDMI YSLLEKGRLI IR 
ENNRVVVSSDILVNN-TNMFI FLLFLTLTSGS DLDRCTT FDDVQAPN YTQHT SSMRGVYY 
PDEI FRSDTLYLTQDLFLPFYSNVTGFHTINHTFGNPVI PFKDGI YFAATEKSNVVRGWV 
FGSTMNNKSQSVI I INNSTNVVIRACNFELCDNPFFAVSKPMGTQTHTMI FDNAFNCTFE 
YISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPI DVVRDLPSGFNTLKPI FKL 
PLGINITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVDCSQNP 
LAELKCSVKS FE I DKGI YQTSNFRVVPSGDVVRFPNI TNLCPFGEVFNATKFPSVYAWER 
KKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFVVKGDDVRQIAPGQT 
GVIADYNYKLPDDFMGCVLAWNTRNI DATSTGNYNYKYRYLRHGKLRPFERDI SNVPFSP 
DGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVVVLSFELLNAPATVCGPKLSTDLIKN 
QCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPKTSE ILDI SPCAFGGVS 
VITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRI YSTGNNVFQTQAGCLIGAEHV 
DTSYECDI P IGAGICASYHTVSLLRSTSQKS I VAYTMSLGADSSI AYSNNT I AI PTNFS I 
SITTEVMPVSMAKTSVDCNMYICGDSTECANLLLQYGS FCTQLNRALSGI AAEQDRNTRE. 
VFAQVKQMYKTPTLKYFGGFNFSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQYGEC 
LGDINARDLICAQKFNGLTVLPPLLTDDMI AAYTAALVSGTATAGWT FGAGAALQI PFAM 
QMAYRFNGIGVTQNVLYENQKQI ANQFNKAI SQI QESLTTTSTALGKLQDVVNQNAQALN 
TLVKQLSSNFGAI SSVLNDILSRLDKVEAEVQI DRLITGRLQSLQTYVTQQLIRAAEIRA 
SANLAATKMSECVLGQSKRVDFCGKGYHLMS FPQAAPHGVVFLHVTYVPSQERNFTTAPA 
ICHEGKAYFPREGVFVFNGTSWFI TQRNFFSPQI ITTDNTFVSGNCDVVIGI INNTVYDP 
LQPELDS FKEELDKYFKNHTSPDVDLGDI SGINASVVNIQKEI DRLNEVAKNLNESLI DL 
QELGKYEQYIKWPWYVWLGFIAGLIAIVMVT ILLCCMTSCCSCLKGACSCGSCCKFDEDD 
SEPVLKGVKLHYT-TNLWICL-DFLLLDQLLHSQ-KLTMLLLQVLFMLQQRYRYKPHSLS 
DGLLLALHFLLFFRALPK — LRS I KDGS — PFIRASSS FAI YCC YLLPS IHI FCLSLQVWRR 
NFCTSMP- YI FYNASTHVELL- DVGFVGSANPRTHYFMMPTTLFAGTHI TMTTVYHITVS 
QIQLSLLKVTAFQHQNSKKTTKLVVILRIGTQVLKTMSLYMAI SPKFTTSLSLHKLLQTL 
VLKMLHSSSLTSLLKTHRMCKYTQSTALQELLIQQWIQFMMSRRRLLACLCKHKKVSTNL 

CTHS FRKKQVR LI AYFFFLLSWYSC-SH- PSLLRFDCVRTAAI LLT- V — NQRFTSTR 

VLKI -TLLKEFLI FWSKRTNYYYYSVWNFNI AYHGRQRYYYR-GA-TTPGTMEPSNRFPI 
PSLDYVTTICLF-SEQVFVHNKACFPLALVASNTCLFCACCCLQN-LGDWRDCDCNGLYC 
RLDVA-LLRCFLQAVCSYPLNVVIQPRNKHSSQCASPGDNCDQTAHGK-TCHWCCDHSWS 
LANGRTLPRAL-H-GPAKRDHCGYI TNAFLLQIRSVAACRH- FRFCC IQPLPYWKL- I KY 

RPRR-QRQYCFASTVSDNRCFI LLTSRLQ-QRY-LSL-GLSGLLFGI LTL — VQ DNYL 

SL-LRRI IRS— MMKNLWS— I IHKTNMKI I LFLTL I VFTSCELYHYQECVRGTTVLLKE PC 
PSGTYEGNS PFHPLADNKFALTCTSTHFAFACADGTRHTYQLRARSVSPKLFIRQEEVQQ 

ELYSPLFLI VAALVFLILCFT IKRKTE— MSSL— LTS ICAF— PFCYSLF CLLYFGFHSK 

SRI -KNLVPKSKRT— NFSLF-LVFLYAVAYAL— YSAVHLINLMCLKILVRYNTRGNTYST 
AWLCALGKVLPFHRWHTMVQTCTPNVTINCQDPAGGALI ARCWYLHEGHQTAAFRDVLVV 

LNKRTN-NV WTPIKPT-CPPHYIWWTHRFN-Q-PEWRTQWGKAKTAPTPRFTQ-YCVL 

VHSSHSAWQGGT- I PSRPGRSNQHQ-WSR- PNWLLPKS YPTS SWW-RQNERAQPQMVLLL 
PRNWPRS FTSLRR-QRRHRMGCN-GSLEYTQRPHWHPQS — QCCHRATTS SRNNI AKRLL 
RRGKQRRQSSLFSLLI T-SR- FKKFNSWQQ-GKFSCSNG-RRW-NCPRAI AARQIEPA-E 
QSFW-RPTTTRPNCH-EICC-GI -KASPKTYCHKTVQRHSS I WETWSRTNPRKFRGPRPN 
QTRN-LQTLAANCTICSKCLCILWNVTHWHGSHTFGNMADLSWSH-IG-QRSTIQRQRHT 
AEQAH-RIQNIPTNRA-KGQKEKD — SSAFAAETKEAAHCDSSSCG-HG- FLQTTSKFHE 
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WSFC- FNSGINTHDDHTRQMGYVNVFAI PFTIHSLLLCRMNSRN-TAQVGLVNFNLT-QS 
LINV-H-GGLERATTFSSRPRGVRSRVQ- IMLGRAAYMEEP-CVKLI LVVLS PCDFNSFL 

FIGURE 25 

PTPRTLDAVCGCINVFKRVCGVSAARLTPCGTGTSTDVVYRAFDIYNEKVAG FAKFLKTNCCRFQEK 
DEEGNLLDSYFVVKRHTMSNYQHEETIYNLVKDCPAVAVHDFFKFRVDGDMVPHISRQRLTKYTMAD 
LVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQSLLKTVQFC 
DAMRDAGIVGVLTLDNQDLNGNWYDFGDFVQVAPGCGVPIVDSYYSLLMPILTLTRALAAESHMDAD 
LAKPLIKWDLLKYDFTEERLCLFDRYFKYWDQTYHPNCINCLDDRCILHCANFNVLFSTVFPPTSFG 
PLVRKI FVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKRTT 
CFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLP 
TMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALF 
AYTKRNVI PT I TQMNLKYAI SAKNRARTVAGVS I CSTMTNRQFHQKLLKS I AATRGATVVI GTSKFY 
GGWHNMLKTVYSDVETPHLMGWDYPKCDRAMPNMLRIMASLVLARKHNTCCNLSHRFYRLANECAQV 
LSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYE 
CLYRNRDVDHEFVDEFYAYLRKHFSMMILSDDAVVCYNSNYAAQGLVASIKNFKAVLYYQNNVFMSE 
AKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLA 
IDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTV 
LQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYL 
GGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKL 
FAAETLKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYT 
FEKGDYGDAVVYRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNV 
ANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPI DKCSRI 
I PARARVECFDKFKVNSTLEQYVFCTVNALPETTADI VVFDEISMATNYDLSVVNARLRAKHYVYIG 
DPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDK 
SAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVD 
SSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKIGILCIMSDRDLYDKLQFTSLEI PRRNVATLQ 
AENVTGLFKDCSKIITGLHPTQAPTHLSVDIKFKTEGLCVDI PGI PKDMTYRRLI SMMGFKMNYQVN 
GYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLGFSTGVNLVAVPTGYVDTENNTEFT 
RVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKI 
GPERTCCLCDKRATCFSTSSDTYACWNHSVGFDYVYNPFMI DVQQWGFTGNLQSNHDQHCQVHGNAH 
VASCDAIMTRCLAVHECFVKRVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNP 
KAIKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATHHDKFTDGVCLFWNCNVDRYPANAIVCRFD 
TRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDI DYVPLK 
SATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWI YKQFDTYNLWNTFTRLQSLENVAYNV 
VNKGHFDGHAGEAPVSI INNAVYTKVDGI DVEIFENKTTLPVNVAFELWAKRNIKPVPEIKILNNLG 
VDIAANTVIWDYKREAPAHVSTIGVCTMTDIAKKPTESACSSLTVLFDGRVEGQVDLFRNARNGVLI 
TEGSVKGLTPSKGPAQASVNGVTLIGESVKTQFNYFKKVDGI IQQLPETYFTQSRDLEDFKPRSQME 
TDFLELAMDEFIQRYKLEGYAFEHIVYGDFSHGQLGGLHLMIGLAKRSQDSPLKLEDFI PMDSTVKN 
YFITDAQTGSSKCVCSVI DLLLDDFVEI IKSQDLSVISKVVKVTI DYAEISFMLWCKDGHVETFYPK 
LQASRAWQPGVAMPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMNVAKYTQLCQYLNTLTLAVPYNM 
RVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDAYSTLIGDCATVHTANKWDLI ISDMY 
DPRTKHVTKENDSKEGFFTYLCGFIKQKLALGGSIAVKITEHSWNADLYKLMGHFSWWTAFVTNVNA 
SSSEAFLIGANYLGKPKEQI DGYTMHANYI FWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQ 
INDMIYSLLEKGRLIIRENNRVVVSSDILVNN*TNMFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQ 
HTSSMRGVYYPDEIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFGNPVIPFKDGI YFAATEKSNVVR 
GWVFGSTMNNKSQSVI I INNSTNVVIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCTFEYI SD 
AFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDWRDLPSGFNTLKPIFKLPLGINITNFRA 
ILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIY 
QTSNFRVVPSGDVVRFPNITNLCPFGEVFNATKFPSVYAWERKKI SNCVADYSVLYNSTFFSTFKCY 
GVSATKLNDLCFSNVYADSFVVKGDDVRQIAPGQTGVIADYNYKLPDDFMGCVLAWNTRNI DATSTG 
NYNYKYRYLRHGKLRPFERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRWVLSF 
ELLNAPATVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPKTS 
EILDISPCAFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGC 
LIGAEHVDTSYECDI PIGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAI PTNFSI 
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SITTEVMPVSMAKTSVDCNMYICGDSTECANLLLQYGSFCTQLNRALSGIAAEQDRNTREVFAQVKQ 
MYKTPTLKYFGGFNFSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQYGECLGDINARDLICAQK 
FNGLTVLPPLLTDDMIAAYTAALVSGTATAGWTFGAGAALQI PFAMQMAYRFNGIGVTQNVLYENQK 
QIANQFNKAISQIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVE 
AEVQI DRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQA 
APHGVVFLHVTYVPSQERNFTTAPAICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQI ITTDNTFV 
SGNCDVVIGIINNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEI DRLNEV 
AKNLNESLI DLQELGKYEQYIKWPWYVWLGFIAGLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKF 
DEDDSEPVLKGVKLHYT * 
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FIGURE 27 



700 
600 
500 
400 
300 
200 
100 
0 

•100 



Flow through; 
host cell proteins (HOP) VlrUS Peak 



200 



250 



300 



350 



4 10 



FIGURE 28 

123456 123456 



250 kDa 
150 kDa 
100 kDa 
75 kDa 

50 kDa 
37 kDa 

25 kDa 
15 kDa 




PP20480.019 



94/199 

FIGURE 29 
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FIGURE 31 
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FIGURE 33 
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FIGURE 35 
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FIGURE 36C 
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FIGURE 37 
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FIGURE 38 




FIGURE 39 

FIGURE 39A FIGURE 39B 




PP20480.019 



101/199 

FIGURE 40 
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FIGURE 41 
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FIGURE 45 
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FIGURE 48 
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FIGURE 50 
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A. 293 cell lysates 
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FIGURE 54 
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FIGURE 57 
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FIGURE 59 
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FIGURE 61 
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FIGURE 63 
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AAGCTTACAAAACAAA 



FIGURE 65 

1 10 
MSDLDRCTTF 
ATG AGT GAC CTT GAC CGG TGC ACC ACT TTT 



20 

DDVQAPNYTQHTSSM 
GAT GAT GTT CAA GCT CCT AAT TAC ACT CAA CAT ACT TCA TCT ATG 

30 40 
RGVYYPDEI FRSDTL 
AGG GGG GTT TAC TAT CCT GAT GAA ATT TTT AGA TCA GAC ACT CTT 
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YLTQDLFLPFYSNVT 
TAT TTA ACT CAG GAT TTA TTT CTT CCA TTT TAT TCT AAT GTT ACA 
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GGG TTT CAT ACT ATT AAT CAT ACG TTT GGC AAC CCT GTC ATA CCT 

80 

FKDGI YFAATEKSNV 
TTT AAG GAT GGT ATT TAT TTT GCT GCC ACA GAG AAA TCA AAT GTT 
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VRGWVFGSTMNNKSQ 
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SVIIINNSTNVVIRA 
TCG GTG ATT ATT ATT AAC AAT TCT ACT AAT GTT GTT ATA CGA GCA 
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CNFELCDNPFFAVSK 
TGT AAC TTT GAA TTG TGT GAC AAC CCT TTC TTT GCT GTT TCT AAA 
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PMGTQTHTMI FDNA F 
CCC ATG GGT ACA CAG ACA CAT ACT ATG ATA TTC GAT AAT GCA TTT 
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NCTFEYI SDAFSLDV 
AAT TGC ACT TTC GAG TAC ATA TCT GAT GCC TTT TCG CTT GAT GTT 
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TCA GAA AAG TCA GGT AAT TTT AAA CAC TTA CGA GAG TTT GTG TTT 
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KNKDGFLY VYKGYQP 
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I D V V R 
ATA GAT GTA GTT CGT 
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200 

D L P S G 
GAT CTA CCT TCT GGT 



P L G I N 
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TTT TCA CCT GCT CAA 
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ACA GCA ATT CAT GCA GAT CAA CTC ACA CCA GCT TGG CGC ATA TAT 

62 0 

STGNNVFQTQAGCLI 
TCT ACT GGA AAC AAT GTA TTC CAG ACT CAA GCA GGC TGT CTT ATA 

630 640 
GAEHVDTSYECDI PI 
GGA GCT GAG CAT GTT GAT ACT TCT TAT GAG TGC GAC ATT CCT ATT 

650 

GAGICASYHTOC SEQ ID NO: 97 99 

GGA GCT GGC ATT TGT GCT AGT TAC CAT ACA TAA TGAGTCGAC SEQ ID NO : 9800 

Translated Mol . Weight = 72525.52 



FIGURE 66 
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DDVQAPNYTQHTSSM 
GAT GAT GTT CAA GCT CCT AAT TAC ACT CAA CAT ACT TCA TCT ATG 

30 40 
RGVYYPDEI FRSDTL 
AGG GGG GTT TAC TAT CCT GAT GAA ATT TTT AGA TCA GAC ACT CTT 

50 

YLTQDLFLPFYSNVT 
TAT TTA ACT CAG GAT TTA TTT CTT CCA TTT TAT TCT AAT GTT ACA 

60 70 
GFHTINHTFGNPVIP 
GGG TTT CAT ACT ATT AAT CAT ACG TTT GGC AAC CCT GTC ATA CCT 

80 

FKDGI YFAATEKSNV 
TTT AAG GAT GGT ATT TAT TTT GCT GCC ACA GAG AAA TCA AAT GTT 

90 100 
VRGWVFGSTMNNKSQ 
GTC CGT GGT TGG GTT TTT GGT TCT ACC ATG AAC AAC AAG TCA CAG 

110 

SVIIINNSTNVVIRA 
TCG GTG ATT ATT ATT AAC AAT TCT ACT AAT GTT GTT ATA CGA GCA 

120 130 
CNFELCDNPFFAVSK 
TGT AAC TTT GAA TTG TGT GAC AAC CCT TTC TTT GCT GTT TCT AAA 

140 

PMGTQTHTMI FDNAF 
CCC ATG GGT ACA CAG ACA CAT ACT ATG ATA TTC GAT AAT GCA TTT 

150 160 
NCTFEYI SDAFSLDV 



PP20480.019 



AAT TGC ACT TTC GAG 



S E K S G 
TCA GAA AAG TCA GGT 

180 

K N K D G 
AAA AAT AAA GAT GGG 



I D V V R 
ATA GAT GTA GTT CGT 

210 

P I F K L 
CCT ATT TTT AAG TTG 



A I L T A 
GCC ATT CTT ACA GCC 
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S A A A Y 
TCA GCT GCA GCC TAT 



M L K Y D 
ATG CTC AAG TAT GAT 
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C S Q N P 
TGT TCT CAA AAT CCA 



F E I D K 
TTT GAG ATT GAC AAA 
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V P S G D 
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390 
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TAC ATA TCT GAT GCC 
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N F K H L 
AAT TTT AAA CAC TTA 



F L Y V Y 
TTT CTC TAT GTT TAT 
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D L P S G 
GAT CTA CCT TCT GGT 



P L G I N 
CCT CTT GGT ATT AAC 
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F S P A Q 
TTT TCA CCT GCT CAA 



F V G Y L 
TTT GTT GGC TAT TTA 
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E N G T I 
GAA AAT GGT ACA ATC 



L A E L K 

CTT GCT GAA CTC AAA 

290 

G I Y Q T 
GGA ATT TAC CAG ACC 



V V R F P 
GTT GTG AGA TTC CCT 

320 

V F N A T 
GTT TTT AAT GCT ACT 
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AAA AAA ATT TCT AAT 
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AAG CCA ACT ACA TTT 



T D A V D 
ACA GAT GCT GTT GAT 
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C S V K S 
TGC TCT GTT AAG AGC 



S N F R V 
TCT AAT TTC AGG GTT 

310 

N I T N L 
AAT ATT ACA AAC TTG 



K F P S V 
AAA TTC CCT TCT GTC 

340 

C V A D Y 
TGT GTT GCT GAT TAC 



T F K C Y 
ACC TTT AAG TGC TAT 

370 

C F S N V 
TGC TTC TCC AAT GTC 



D V R Q I 
GAT GTA AGA CAA ATA 

400 
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A P G Q T 
GCG CCA GGG CAA ACT 



P D D F M 
CCA GAT GAT TTC ATG 

420 

I D A T S 
ATT GAT GCT ACT TCA 



L R H G K 
CTT AGA CAT GGC AAG 

450 

V P F S P 
GTG CCT TTC TCC CCT 



N C Y W P 
AAT TGT TAT TGG CCA 

480 

G I G Y Q 
GGC ATT GGC TAC CAA 



L L N A P 
CTT TTA AAT GCA CCG 

510 

D L I K N 
GAC CTT ATT AAG AAC 



T G T G V 
ACT GGT ACT GGT GTG 

540 

F Q Q F G 
TTT CAA CAA TTT GGC 



R D P K T 
CGA GAT CCT AAA ACA 

570 

F G G V S 
TTT GGG GGT GTA AGT 



E V A V L 
GAA GTT GCT GTT CTA 

600 

T A I H A 
ACA GCA ATT CAT GCA 



S T G N N 
TCT ACT GGA AAC AAT 
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G V I A D 
GGT GTT ATT GCT GAT 

410 

G C V L A 
GGT TGT GTC CTT GCT 



T G N Y N 
ACT GGT AAT TAT AAT 

440 

L R P F E 
CTT AGG CCC TTT GAG 



D G K P C 
GAT GGC AAA CCT TGC 

470 

L N D Y G 
TTA AAT GAT TAT GGT 



P Y R V V 
CCT TAC AGA GTT GTA 

500 

A T V C G 
GCC ACG GTT TGT GGA 



Q C V N F 
CAG TGT GTC AAT TTT 

530 

L T P S S 
TTA ACT CCT TCT TCA 



R D V S D 
CGT GAT GTT TCT GAT 

560 

S E I L D 
TCT GAA ATA TTA GAC 



V I T P G 
GTA ATT ACA CCT GGA 

590 

Y Q D V N 
TAT CAA GAT GTT AAC 



D Q L T P 
GAT CAA CTC ACA CCA 

620 

V F Q T Q 
GTA TTC CAG ACT CAA 



Y N Y K L 
TAT AAT TAT AAA TTG 



W N T R N 
TGG AAT ACT AGG AAC 

430 

Y K Y R Y 
TAT AAA TAT AGG TAT 



R D I S N 
AGA GAC ATA TCT AAT 

460 

T P P A L 
ACC CCA CCT GCT CTT 



F Y T T T 
TTT TAC ACC ACT ACT 

490 

V L S F E 
GTA CTT TCT TTT GAA 



P K L S T 
CCA AAA TTA TCC ACT 

520 

N F N G L 
AAT TTT AAT GGA CTC 



K R F Q P 
AAG AGA TTT CAA CCA 

550 

F T D S V 
TTC ACT GAT TCC GTT 



I S P C S 
ATT TCA CCT TGC TCT 

580 

T N A S S 
ACA AAT GCT TCA TCT 



C T D V S 
TGC ACT GAT GTT TCT 

610 

A W R I Y 
GCT TGG CGC ATA TAT 



A G C L I 
GCA GGC TGT CTT ATA 
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630 

G A E H V 
GGA GCT GAG CAT GTC 



G A G I C 
GGA GCT GGC ATT TGT 

660 

S T S Q K 
AGT ACT AGC CAA AAA 



A D S S I 
GCT GAT AGT TCA ATT 

690 

T N F S I 
ACT AAC TTT TCA ATT 



M A K T S 
ATG GCT AAA ACC TCC 

720 

S T E C A 
TCT ACT GAA TGT GCT 



T Q L N R 
ACA CAA CTA AAT CGT 

750 

R N T R E 
CGC AAC ACA CGT GAA 



T P T L K 
ACC CCA ACT TTG AAA 

780 

L P D P L 
TTA CCT GAC CCT CTA 



L L F N K 
TTG CTC TTT AAT AAG 

810 

Q Y G E C 
CAA TAT GGC GAA TGC 



C A Q K F 
TGT GCG CAG AAG TTC 

840 

T D D M I 
ACT GAT GAT ATG ATT 



T A T A G 
ACT GCC ACT GCT GGA 
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D T S Y E 
GAC ACT TCT TAT GAG 

650 

A S Y H T 
GCT AGT TAC CAT ACA 



S I V A Y 
TCT ATT GTG GCT TAT 

680 

A Y S N N 
GCT TAC TCT AAT AAC 



S I T T E 
AGC ATT ACT ACA GAA 

710 

V D C N M 
GTA GAT TGT AAT ATG 



N L L L Q 
AAT TTG CTT CTC CAA 

740 

A L S G I 
GCA CTC TCA GGT ATT 



V F A Q V 
GTG TTC GCT CAA GTC 

770 

Y F G G F 
TAT TTT GGT GGT TTT 



K P T K R 
AAG CCA ACT AAG AGG 

800 

V T L A D 
GTG ACA CTC GCT GAT 



L G D I N 
CTA GGT GAT ATT AAT 

830 

N G L T V 
AAT GGA CTT ACA GTG 



A A Y T A 
GCT GCC TAC ACT GCT 

860 

W T F G A 
TGG ACA TTT GGT GCT 



640 

C D I P I 
TGC GAC ATT CCT ATT 



V S L L R 
GTT TCT TTA TTA CGT 

670 

T M S L G 
ACT ATG TCT TTA GGT 



T I A I P 
ACC ATT GCT ATA CCT 

700 

V M P V S 
GTA ATG CCT GTT TCT 



Y I C G D 
TAC ATC TGC GGA GAT 

730 

Y G S F C 
TAT GGT AGC TTT TGC 



A A E Q D 
GCT GCT GAA CAG GAT 

760 

K Q M Y K 
AAA CAA ATG TAC AAA 



N F S Q I 
AAT TTT TCA CAA ATA 

790 

S F I E D 
TCT TTT ATT GAG GAC 



A G F M K 
GCT GGC TTC ATG AAG 

820 

A R D L I 
GCT AGG GAC CTC ATT 



L P P L L 
TTG CCA CCT CTG CTC 

850 

A L V S G 
GCT CTA GTT AGT GGT 



G A A L Q 
GGC GCT GCT CTT CAA 
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I P F 
ATA CCT TTT 



V T Q 
GTT ACC CAA 



Q F N 
CAA TTT AAC 



T S T 
ACA TCA ACT 



A Q A 
GCT CAA GCA 



G A I 
GGT GCA ATT 



K V E 
AAA GTC GAG 



L Q S 
CTT CAA AGC 



A E I 
GCT GAA ATC 



E C V 
GAG TGT GTT 



G Y H 
GGC TAC CAC 



V F L 
GTC TTC CTA 



T T A 
ACC ACA GCG 



REG 
CGT GAA GGT 



Q R N 
CAG AGG AAC 



F V S 



870 
A M Q 
GCT ATG CAA 



N V L 

AAT GTT CTC 

900 
K A I 

AAG GCG ATT 



A L G 

GCA TTG GGC 

930 

L N T 

TTA AAC ACA 



S S V 

TCA AGT GTG 

960 

A E V 

GCG GAG GTA 



L Q T 

CTT CAA ACC 

990 
R A S 

AGG GCT TCT 



L G Q 
CTT GGA CAA 

1020 
L M S 
CTT ATG TCC 



H V T 
CAT GTC ACG 

1050 
P A I 
CCA GCA ATT 



V F V 
GTT TTT GTG 

1080 
F F S 
TTC TTT TCT 



G N C 



MAY 
ATG GCA TAT 



YEN 
TAT GAG AAC 



S Q I 
AGT CAA ATT 



K L Q 
AAG CTG CAA 



L V K 
CTT GTT AAA 



L N D 
CTA AAT GAT 



Q I D 
CAA ATT GAC 



Y V T 
TAT GTA ACA 



A N L 
GCT AAT CTT 



S K R 
TCA AAA AGA 



F P Q 
TTC CCA CAA 



Y V P 
TAT GTG CCA 



CHE 
TGT CAT GAA 



F N G 
TTT AAT GGC 



P Q I 
CCA CAA ATA 



D V V 



R F N 
AGG TTC AAT 

890 

Q K Q 
CAA AAA CAA 



Q E S 
CAA GAA TCA 

920 

D V V 
GAC GTT GTT 



Q L S 

CAA CTT AGC 

950 

I L S 

ATC CTT TCG 



R L I 

AGG TTA ATT 

980 

Q Q L 

CAA CAA CTA 



AAT 
GCT GCT ACT 

1010 

V D F 
GTT GAC TTT 



A A P 

GCA GCC CCG 

1040 

S Q E 

TCC CAG GAG 



G K A 

GGC AAA GCA 

1070 

T S W 

ACT TCT TGG 



ITT 
ATT ACT ACA 

1100 
I G I 



880 
GIG 
GGC ATT GGA 



IAN 
ATC GCC AAC 

910 
L T T 
CTT ACA ACA 



N Q N 
AAC CAG AAT 

940 
S N F 
TCT AAT TTT 



R L D 
CGA CTT GAT 

970 
T G R 
ACA GGC AGA 



IRA 
ATC AGG GCT 

1000 
K M S 
AAA ATG TCT 



C G K 
TGT GGA AAG 

1030 
H G V 
CAT GGT GTT 



R N F 
AGG AAC TTC 

1060 
Y F P 
TAC TTC CCT 



FIT 

TTT ATT ACA 

1090 
D N T 
GAC AAT ACA 



INN 
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TTT GTC TCA GGA AAT TGT GAT GTC GTT ATT GGC ATC ATT AAC AAC 



T V Y D 

AC A GTT TAT GAT 

E L D K 

GAG CTG GAC AAG 



F G D I 
TTT GGC GAC ATT 



K E I D 
AAA GAA ATT GAC 



SLID 
TCA CTC ATT GAC 

1183 
K W P OC 
AAA TGG CCT TAA 



1110 

P L Q P 

CCT CTG CAA CCT 

Y F K N 

TAC TTC AAA AAT 

1140 

S G I N 

TCA GGC ATT AAC 

R L N E 

CGC CTC AAT GAG 

1170 

L Q E L 

CTT CAA GAA TTG 



ELDS 
GAG CTT GAC TCA 

1130 
H T S P 
CAT ACA TCA CCA 



A S V V 

GCT TCT GTC GTC 

1160 
V A K N 

GTC GCT AAA AAT 



G K Y E 
GGA AAA TAT GAG 



1120 
F K E 
TTC AAA GAA 

DVD 
GAT GTT GAT 

1150 
N I Q 
AAC ATT CAA 



L N E 
TTA AAT GAA 

1180 
Q Y I 
CAA TAT ATT 



TGAGTCGAC 
Translated Mol . Weight = 131315.20 



SEQ ID NO: 9801 
SEQ ID NO: 9802 
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FIGURE 68 
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FIGURE 73 
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FIGURE 76 
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FIGURE 79 
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FIGURE 81 FIGURE 82 
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FIGURE 84 
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FIGURE 86 
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FIGURE 97 FIGURE 98 FIGURE 99 




4: 



FIGURE 100 

si 



Tot sol 



— 66 kd 



FIGURE 101 



97 
64 

28 
19 



oe 



4 



Tot Sol 



^-t ! 1 



98 

62 
49 
38 
28 

19 
14 



FIGURE 102 

pGex 
o rf7 A18 pGex 

Toj Sol Tot Sol 



8 jvi^ 



I ; 







hL^m-zzzAL 





6 _., 



133/199 



FIGURE 103 
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FIGURE 106 
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FIGURE 109 
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FIGURE 111 
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FIGURE 113 

5 '3' Frame 1 

PKDMTYVDSSL-WVS-ITKSMVTLICLSPAKKLFVTFVRGLALM-RAVMQLEMLWVLTYLS 
S-DFLQVLT--LYRLVMLTLKITQNSPELMHKPPPVSSLNILYHSCIKACPGM-CVLR-YK 
CSVIH-KDCQTESCSSFGRMALSLHQ-STLSRLDLKERVVCVTNVQLAFLLHQILMPAGII 
LWVLTMSITHL-LMFSSGGFTGNLSE-P-PTLPGTWKCTCGLVVML 



5 '3' Frame 2 

QRT-PT-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN-RCCGY-PTS 
PARIFYRC-LSSCTDWLC-H-K-HKIHQS-CTNLHQ-AV-TSYTTHV-RLALECSAY-DST 
NAQ-YTERIVRQSRVRPLGAWL-AYINEVLCQDWT-KNVLSV-QTCNLLFYFIRYLCLLES 
FCGF-LCL-PIYD-CSAVGALRVTFQSNHDQHCQVHGNAHVG-L-C 

5' 3' Frame 3 

KGHDLRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 
QLGFSTGVNLVAVPTGYVDTENNTKFTRVNAQTSTSEQFKHLIPLMYKGLPWNVVRIKIVQ 
MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNH 
SVGFDYVYNPFMIDVQQWGLYG-PFRVTMTNIARYMEMHMWASCDA 

3' 5' Frame 1 

-HHN-PTCAFPCTWQCWSWLL-KVTRKAPTAEHQS-MGYRHSQNPQNDSSRHKYLMK-KSK 
LHVCHTDNTFFQVQS-QSTSLM-AQSHAPKGRTRLCLTILSVYH-AFVLS-YALHSRASLY 
T-VV-DV-TAHWWRFVH-LW-ILCYFQCQHNQSVQLLS-HL-KILAGEVG-YPQHL-LHDS 
PLHQSQSTHERDE-LLRG— TY-GNH-LGNSF-NPS-R-VYVGHVLW 

3' 5' Frame 2 

SITTSPHVHFHVPGNVGHGYSERLPVKPPLLNINHKWVIDIVKTHRMIPAGISI— SRKAS 
CTFVTQTTRSFRSNLDKVLH-CKLKAMRPKDEHDSV-QSFQCITEHLYYLNTHYIPGQAFI 
HEWYKMFKLLTGGGLCINSGEFCVIFSVNITSRYSY-VNTCRKS-LER-VSTHSISSCMTA 
LYIKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDEST-VMSF 

3' 5' Frame 3 

ASQLAHMCISMYLAMLVMVTLKGYP-SPHC-TSIINGL-T-SKPTE-FQQA-VSDEVEKQV 
ARLSHRQHVLSGPILTKYFIDVSSKPCAQRTNTTLSDNPFSVSLSICTILIRTTFQGKPLY 
MSGIRCLNCSLVEVCALTLVNFVLFSVST-PVGTATKLTPVENPSWRGRLVPTASLVA-QP 
STSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLRRSCPL 
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FIGURE 114 

5' 3' Frame 1 

YRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLGF 
STGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLI 

5' 3' Frame 2 

TVDSSL-WVSK-ITKSMVTLICLSPAKKLFVTFVRGLALM-RAVMQLEMLWVLTYLSS-DF 
LQVLT--LYRLVMLTLKITQNSPELMQNLHQVTSLNILY 

5' 3' Frame 3 

P-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN-RCCGY-PTSPARIF 
YRC-LSSCTDWLC-H-K-HRIHQS-CKTSTR-PV-TSYT 

3'5' Frame 1 

GIRCLNWSPGGGFALTLVNSVLFSVST-PVGTATKLTPVENPSWRGRLVPTASLVA-QPST 
SKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLR 

3 ' 5 ' Frame 2 

V-DV-TGHLVEVLH-LW-ILCYFQCQHNQSVQLLS-HL-KILAGEVG-YPQHL-LHDSPLH 
QSQSTHERDE-LLRG— TY-GNH.-LGNSF-NPS-R-VYG 

3 '5' Frame 3 

YKMFKLVTWWRFCINSGEFCVIFSVNITSRYSY-VNTCRKS-LER-VSTHSISSCMTALYI 
KANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDESTV 
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FIGURE 115 
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FIGURE 115 (contd.) 

■—. Section 157 

(6085) 6035 6090 610 0 611 0 .6123 

(5906) FKT>G5K-S.Y : s:GY]iS 

(5473 ) FKI.C-M MUftfe MvWv&$mWF K A T VS&fcA i» fitlVB 
(5990) FK BG&RSYVGY M-SAfiACS-FXA^D B-K^k^GGfe'MQL-M'WS 
(1) _______ 

(6085) FKDCS KS YSGYHPAHAP S FLAVDDKYKV GDLAVCLNVA 
Section 158 

(6124) 6124 ,6130 6140 6.150 6162 

(5945 D-SAVUY£¥L • ^TslltTEKilD^'^i i3'ft3f.CK$ft TXlcSs MfSf&V 
(551 2) AG S; ET.TYK1-J L1B%%% F-UfVS V tttfESC H_J!$F *TI^BA^«V 
(6029) D - S."Ay.'B Y»S Rii. I S If Fl G FHLDL T h L-^'.Y.C K-L - u * R^EJ A_M|j|y 

(1 ) — K^H DXsR'Ki*' t'sttfc FKjg}'3 YQ||?I £|^g||§gXff,$ ^ Eft ¥ 
(6124) D SAVT YSRL I SLMGFKLDVTL DG YCNL F I TRDE A I KRV 

Section 159 

(6163)6^63: 61 70 6160 6100 Wgft 

(5983) B|WpF • s A . G £' l!A$#lfSC_G 7 W .F §Hi## Si#^||Al 
{5551 ) sgfi V.S F fcj&B&T B*C G T i • i ' ■ ' L " 0 V ' . T-AP %^ff^ 
(6067) - AW VG Ft>A3 GXH A I RTJ'SXtS"^* F 2 _■ v i -r - ' IDFWtfAT- 
(38) .R-AVi T ' , V _l!rtAT?*DAV ' L LqJ|v?P *3 f " ! • V. K • • " V'pT 
(6163) RAWVG F D VEGAHAT RDSI6$-»X PX.&L.G FSTGI DFVVEPT 
^^^—^^ Section 160 

(6202) 6202 .6210. £220 , Jg^Q. 6240 

(6022) G' _vF A D RDM_*YSS FM : A V A.^A!#>§2 O F K H - 1 P \tfJf$XsjQR»M 
(5590) •- W D'TS IpNNi'E Ppi^9^1p.^Q-PK iiii.R VliTK S A '. P J'HV 
(61 06) • M F A IS R D G'.Y.V F;K K < A A RAP P GLE Q F K-Hi,J> 1, M^RGj|f K#D>" 
(77) _ : rV: ; 6-TSKWTKt- . Ry;."A-. .-: £ . r K. ._ I P-.MVK5 LPylhV 
(6202) GLVDTRDGY FKKVNAKAPPGEQFKHLI PLMSRGQPWDV 
_ — _ — Section 161 

(6241) 6241 ,625p ..,6266 _ _ 6279 

(6061 ) {n>9R ' V * !-\ - k&Wh "l"! X s;Tj!BV V^y T AM'i-i y T'S L R" FA fC 
(5629) .Ik PR" V _ M : . £ C * V S JCW P VT W Lima / 

(6145) f ftlRXV^MLS'v ffL*Jfj.Ai» "* L 'T» A A ?J iF E IfT-'C^a Ri?jT. £K 
(1 16} V R I Jl _ V Q M L^'D T 1» K L §"OR k r ■/ F V L W "A F F T ^ SM" s ' ^*^^- 
(6241) VRPRI VQMLADHL DLSDCVVLVTWAHGFELTCLRYFVK 
, , __-_--. Section 162 

(6280) 6280 .6290 _____ ,6§0G\_ „ §318 

(6100) .v>"r' t sc N^ci#4-Ar. £ ^ w rtg'y *'g twiH'^yr Cjy^Y«f 

(5668) I -OVGSCGSBATT'F M-S H T'Q A Y«p^rgtl.CipM D ^^W f 
(61 84) V -3 R - V7RS V^TtiRv . C FN ^T.jf?TgC W.'R R'^YS C'ft.Y&YN ? 

(■1 55) ilk P S RlG C%: d'I^'A f GlFSislD T Y||G if,^'i^Yl^ ? 
(6280) £ G R E r.S-CC V CT-K-R ATG Ft? S'RT G.Y YAC W RM SV GF D YXYlTF 
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FIGURE 115 (contd.) 



Section 163 

(6319) 6319 _J*330 £340 6357 

(6139) j££$"D:^ 

(5706) W F WfI.|JI»^0H ItB G H AK A S : VtS&.I $ $ R 

(6223) £l\ftii y tSa'S^f ]S#1ll! c^^t^'i^^A^Bai^'f % 

(194) F§Hp§C(QW^.LY^ pf'rv T : MT N j^R^MEM«§W^;CDA 
(63 1 9) L X V D I Q Q « G Y S G S L, S S M H D L H C SVH KGA K V A IsDSIM <P R 
Section 164 

(6358) 6358 . 6370. .„,63B0.___ _ 6396 

(6178) q'L a-y^ p.eiref ■ n Sjjfi p i ; xs;m-iiMs^w : !cs,€kPl.q% 

(5745) titr&f^NABO^Wfeb^^ 

(6262) GLAVH 0.G F-C-K B VN W Jv s L-E Y PT I S K E V S V H TS'G R-JL^L QRV'fii ? 

(229) 

(6358) CLAVH DC FCN VNWNLEYP 1 1 SNELSVNTS CRLLQRVML 
Section 165 

(6397) 6397 6410 6420 6435 

(6217) ^pS^^^^^pt^S 0 — 

(5784) WACpDALgVN^V^GN PKGI KCVRjRGDVN'F,RF YDKNP I 
(6301 ) If AMt.G N RYD V.C Y DIGN ? KG§#3>|gG - -i^F-K ; P,YD^S,P:V 

(229) 

(6397) KAAMLCNRY VC YDIGNPKG I ACVK FDFKFYDANPI 
Section 166 

(6436) 6436 _ 6450 ,6460 .6474 

(6254) VKSVKTlI^ 

* * • -•— . _ - ' - — - • -• ' - " »» ' ' ' ' v»»» «"•-•'••• ' ' 





(229) 

(6436) VKSVKQFLY YEAHKD F DGLCMFWNCNVDKYP NAVV 
Section 167 

(6475) 6475 ,6480 ..§490 JK5QQ ; 6513 

(6293) eie 

(5862) 

(6377) &R#:o : f ?mm K^V^^^^^Mt^W^^^W^^M, 

(229) 

(6475) CRFDTRVLN LNLPGCNGGSLYVNKHAFHT PFSRAAFE 
, „ Section 168 

(6514) 6514 JS20 6530 _ 6540 6552 

(6332) " 
(5901) 
(6416) 

(229) 

(6514) NLKPMP FF YYS DT PCVYMDGMD AKQVD YVPLKSATCITR 
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FIGURE 115 (contd.) 



~ Section 169 

6553 .6560 J 670 6580_ 6591 

ilk; B Af»yfTC^ *S* n!S-K^^]^^y^^fp. 

£^Lfi-GA^ 

CNLGGAVCLKHAEEYREYLE S YNTATTAGFTFWVYKTFD 

Section 170 

6610 6620 6630 



6592 6600* 1 




P;Y:NL-WK'SF;S;ALQ 

FYNLWNTFTKLQ S LEN VV YNLVKAGH YDG AGEMPCAII 

1 Section 171 

6631 £640 [ ' " 6650V. 6669 




GDKVIAKIQ E DVVVF I NNTTFPTNVAVEL FAKRS I R H 

— — — Section 172 

6670 6680 6690 6708 




•? e l I^F^ii^li^f sl-»^:^^igl:H cgstf YiWfe k : Yt' : 



PELKL FRNLN I DVCW H V I WDYAKDS I FC S NT YK VC YT 

-— Section 173 

6709 6720 6730 6747 




DL ID LNVL FDGRDNGALEAFKKA NGVYI STTKIKS 

— _ — . ™»___ — „ Section 174 

6748 6760 6770 6786 

SEQ ID NO: 10068 
SEQ ID NO: 10069 
D SEQ ID NO: 10070 

- SEQ ID NO: 9997/98 

LSMIKGP RADLNGVVVDKVGDSD FWFAVRKDGNDVI SEQ ID NO: 10071 
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FIGURE 116 
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FIGURE 118 

• — — . . Section 1 

(1) 1_ 10 20 ,30 _ ,40 _51 

(1 ) - (i'/G-WMSl t*N j£E$$SV i P A^V^TK^iftATt-Kv'M tlWh AAi^ViE ACttSli ®i 
(1 ) CW^ DCS'K'S;X%3 : S:.-t Kn'm'P S- FLA VWIS?, T'JS-DiiAV'pi* v.LG;0- K S H 3 L 
(1 ) p#5* N I F15MS^§^ I»^H.A P S T '>'i ^D'D'KYK-V'S G;&L AVG L N V-Sft- KtS.Jt Hi 
(1) g|^H t£KD.^;^^^sf ^^A^'^^KXl^XA-V'^^fl^r^^?, 
(1) C S T N LPK D C S K S Y S G Y HPAHAPSFLAVDDKYKVGGDLAVCLNVAD KGHDL 
— • ■ — — — — — Section 2 

(52) 52 ,60 _ ,70 _ .80 _ ,90 102 

(51 ) R RL-I S M M G BK KN YQV N ? N MF'I T R E ?, A L ?, H V P AW I G F-D V E GC FIAT R DAVGl 

(51) ]^ 

(52) RRLISMMGFKMNYQVNGYPNMFITREEAI RHVRAWIGFDVEGCHATRDAVG 

. Section 3 

(103) 103 ,110 ,120 . L 130___ .140 153 

(102) T iftfljgggj^* V N 1 ' V A V D ? LUP- K - T RV 

(102) TML PLQL .G ? S ff .GVN L V AVETGYVD >T E M H 1 K F T R V N A Q T S T S E Q ? K H L I P L M 
(102) T NL'PL Q £*G F S T. G V N L V AVi? T^G^V'tj^E N t?-PK"|^>"R *V, W T»s!$; S^EtQ j| KH^^pSf ^ 

(102) TNL PLQi%£s5i3 VN'LVAV PT^^&f^#^%fj| k M^^M^S : ^fiffl&f^ 

(103) TNL PLQLG FS TGVN LVAVPTG YVDT ENNTK FTRVNAQTS T SEQFKHL I PLM 
— ■ Section 4 

(154) 154 _ _ ,16g__ ,170 ,180 J 90 204 

( 1 53) iYX G T , P W M V V HT'KTVQM1,.SDTL K G L G D P. V V F V L H A KGFELT S ^KYFVKIGP F,' 
( 1 53) Y K G L F W M V V R I K IV Q M L S D T L KG L S D R V,V F VL W A H G F EL TSHKYF V K I,G P E. 
(1 53) | K G L F V7 ' j V | R T K I V 0 1 j| 1 1 1 L 1 1 1 § D»V V F VLWAHG^h V | M K Y F V K I G P E 

(1 53) ,Y K GLPW f# P. I K I V Q M I; S D T L KG L S D P.V V F V LWAH C F E L T S M K Y FVKIGPE' 

(154) Y K G L P W N V V R IK I V Q M L S D T L K G L S D R V V F V L W A H G F E L T S M K Y F VK I G P E 
. ■-. ■ ... - ■ -•• , ; . .-■ ■ ; — * — - — . 1 — — - — - — : — Section 5 

(205) 205 .210 _ _ _ _ 220 _ 230 _ ^240 255 

( 204 ) i'S^^^^SB^^M^^^^^^^^^^MW&lSi; 

(204) RTCCLC DKP.AT'C F S r T S S D 2 Y-AG W N ^^m^^m^^^^0& 5 1 s S 

(204) '^^^^^^m^^^^^^^^^^^^^^^^^^^n^m^ 

(204) RTCC L C D KRAT C FS T S S D I YA C W M H S VG FDYV Y H ? FM I D VQQWGL YGSLSS 

(205) R T C C*L C D K R A T C ' F S I T S S D T Y ^ A C W ' N H S VG F D Y V ^ Y N P FM I D VQ Q WG L Y G S L S S 

^_^«, ■ — - — ■ Section 6 

(256) 256 ,270 280 290 . 3J§ 

i(255) D'l H -Tv JG HAII VASV33A TM-fRCtAfKNA t • CFV; i-L'l >H3fe :DW 
»(255) M u D'L Y >S-Vh-K€AHVASSDR VATBC LAV^JDCs CH'.- i - .N'VL- . •, L Si .LSx., 
I (255) 8HDPieS-?-«ER-iV AS-SOA1MTRC LA V : H Q G r »i K S* VTf W N-L.E'Y ? iTS WF V-S V 
: (255) fl H CStf C'SVK-KSA-HVA^S' DAIMTR-C LAVKD:G-F.&WSV,3VfN%E%PiXS-ti L.LSWH. 
!(256) NHDLHCSVHKGAHVAS S DAIMTRCLAVHDCFCNSVNWNLE YPI ISNELSVN 
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FIGURE 118 (contd.) 

' — — — — Section 7 

(307) 307 _ ,320 330 _ ,340 357 

(306) §ici;Yl>q^ 

(306) - — — — ~ — — - - - — ■■ - • ■ - " - ' — 




(307) TSCRLLQRVMLKAAMLCNRYT VCYDI GN PKG I ACVK D FDFK FYDAN P I V 

— — — — — < ; 1 ; , Section 8 

(358) 358 t 370 3 80 390 408 




(358) KSVKQFLYS YEAHKDS FKDGLCMFWNCNVDKY PANAVVCRFDT RVLN LNL 

Section 9 

(409) 409;_ _£2Q _£3Q _ . __44Q _ m _*59 

(408) CH€^S1YVHK^ F^T'P'K p f • R I g;F P Wl, % AM #F l^p£ D 8 Jjp£ ETfgQ -pD^V A 
(406) p SC & 'SG SJfe'ahrMft HAtfeCT EB'E AA-P-E H I* K^M.F FFY'V S^^e^MBsS^AK^ 
(406) * : • • • *♦ - ™ - - • , - 




(405) 

(409) PGCNGGSLYVNKHAFHTKPFSRAAFENLKPMPFFYYS DTPCVYMDGMDAKQ 
Section 10 

(460) 460 ,470_ _ ,48Q_ ,490 ,500 510 

(458) 
(457) 
(457) 

(456) VDY-V B '-'K-SATiC ITRCK L-SGAVCLKIt^BEy'RWli'SS YKT'^-T-T A--. ? TF>fWS4fC 
(460) VDYVPLKSATCITRCNLGGAVCLKHAEE YRE YLES YNTATTAGFT FWVYKT 

, — — ■. — Section 1 1 

(511)511 _524 

(509)' b K P ¥Wtotf K|^?gAix| SEQ ID NO: 10073 

(508) FDFY^L« ! N$ir¥Ri.l SEQ ID NO: 10074 

(508) ;F^F:/|>|'IvWW.T' :-'T RL J SEQ ID NO: 10075 

(507) rn F " >;kt ••• TKl? SEQ ID NO: 10076 

(511) FDFYNLWNTFTKLQ SEQ ID NO: 10077 
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FIGURE 119 

tagtcaaaacccacagaatgattccagcaggcataagtatctgatgaagtagaaaagcaa 

-VSDEVEKQ 

gttgcacgtTTGtcacacagacaacacgttctttcaggtccaatcTTGacaaagtacttc 
V A R _L_ SHRQHVLSGPI _JL_ T K Y F 

attgatgtaagctcaaagccatgcgcccaaaggacgaacacgactctgtctgacaatcct 
IDVSSKPCAQRTNTTLSDNP 

ttcagtgtatcactgagcatttgtactatcttaatacgcactacattccagggcaagcct 
FSVSLSICTILIRTTFQGKP 

ttatacATGagtggtataagatgtttaaactgctcactggtggaggtttgtgcattaact 
L Y _M_ SGIRCLNCSLVEVCALT 

Ctggtgaattttgtgttattttcagtgtcaacataa SEQ ID NO: 10080 
L V N F V L F S V S T - SEQ ID NO: 10027 



FIGURE 120 

FIGURE 120A 

PRHTQRT-PTVDSSL-WVSK-ITKSMVTLICLSPAKKLFVTFVRGLALM-RAVMQL 
EMLWVLTYLSS-DFLQVLT— LYRLVMLTLKITQNSPELMQNLHQVTSLNILYHSC 
IKACPGM-CVLR-YKCSVIH-KDCQTESCSSFGRMALSLHQ-STLSRLDLKERVVC 
VTNVQLAFLLHQILMPAGIILWVLTMSITHL-LMFSSGALRVTFRVTMTNIARYME 
MHMWLVVMLS-LDV-QSMSALLSALIGLLNTLL-EMN-GLILLAEKYNTWL-SLHC 
LLISFQFFMT-EIQRLSSVCLRLK-NGSSTMLSHVVTKLTK-RNSSILMLYITINS 
LMVFVCFGIVTLIVTQPMQLCVGLTQESCQT-TYQAVMVVVCM-ISMHSTLQLSIK 
VHLLI-SNCLSFTILIVLVSLMANK-CRILIMFHSNLLRVLHDAI-VVLFADTMQM 
STDSTWMHII--FLLDLAYGFTNNLILITCGIHLPGYRV 

FIGURE 120B 

LGIPKGHDLP-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN- 
RCCGY-PTSPARIFYRC-LSSCTDWLC-H-K-HRIHQS-CKTSTR-PV-TSYTTHV 
-RLALECSAY-DSTNAQ-YTERIVRQSRVRPLGAWL-AYINEVLCQDWT-KNVLSV 
-QTCNLLFYFIRYLCLLESFCGF-LCL-PIYD-CSAVGLYG-PSE-P-PTLPGTWK 
CTCG-L-CYHD-MFSSP-VLC-AR-LVC-IPYYRR-TEG-FCLQKSTTHGCEVCIA 
C--VSSSS-HRKSKGYQVCASG-SRMEVLRCSAM--QSLQNRGTLLFLCYTSR-IH 
-WCLFVLEL-R-SLPSQCNCV-V-HKSLVKLELTRL-WW-FVCE-ACIPHSSFR-K 
CIY-FKAIAFLLLF--SL-VSWQTSSVGY-LCSTQICYVYYTMQFRWCCLQTPCK- 
VPTVLGCI-YDDFCWI-PMDLQTI-YL-PVEYIYQVTEF 
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FIGURE 120C 

-AYPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAV 
GTNLPLQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWN 
WRIKIVQMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFST 
SSDTYACWNHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTR 
CLAVHECFVKRVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKA 
IKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANA 
IVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESH 
GKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQF 
DTYNLWNTFTRLQSL 

FIGURE 120D 

-TL-PGKCIPQVISIKLFVNP-AKSSRNHHIICIQVLSVLICMVSANSTT-IASCNTRS 
RFEWNIINIRHYLFAMRLTRTIRIVKERQLL-ISKCTFIESWSVECMLIHIQTTTITAW 
-VQV-QDSCVKPTHNCIGWVTINVTIPKQTNTISEFIVMYSIRIEEFLYFVSFVTTWLS 
IVELPFYFSLRHTLDSLWISYVMKNWKLISKQCRLHNHVLYFSASRINPQFISYNRVFN 
RPINALNKALMDC-TSSHDSITTSHMCISMYLAMLVMVTLKVTRKAPLLNINHKWVIDI 
VKTHRMIPAGISI — SRKASCTFVTQTTRSFRSNLDKVLH-CKLKAMRPKDEHDSV-QS 
FQCITEHLYYLNTHYIPGQAFIHEWYKMFKLVTWWRFCINSGEFCVIFSVNITSRYSY- 
VNTCRKS-LER-VSTHSISSCMTALYIKANPRTNVTNSFFAGDKHIRVTIDLVIHFETH 
HRDESTVGHVLWVCL 

FIGURE 120E 

KLCNLVNVFHRL-VSNCL-IHRLNPAEII ILYASKYCRYSFAWCLQTAPPKLHRVIHVA 
DLSGT-SISDTTCLP-DSQGLSE— KKGNCFKLVNALLSKAGVWNACLFTYKLPPSQPG 
KFKFDKTLVSNLHTIALAG-RSTLQFQNKQTPSVNLS-CIA-E-KSSSIL-ALSLHG-A 
S-NFHSTSA-GTHLIAFGFPMS-RTGNLSASNADFTTMCCTFLQAELTLSSSPIIGYST 
DQSTRLTKHSWTAKHLVMIASQLATCAFPCTWQCWSWLL-RLPVKPHC-TSIINGL-T- 
SKPTE-FQQA-VSDEVEKQVARLSHRQHVLSGPILTKYFIDVSSKPCAQRTNTTLSDNP 
FSVSLSICTILIRTTFQGKPLYMSGIRCLNWSPGGGFALTLVNSVLFSVST-PVGTATK 
LTPVENPSWRGRLVPTASLVA-QPSTSKPIHART-RIASSRVINILG-PLTW-FILKPI 
IEMSLR-VMSFGYA- 

FIGURE 120F 

NSVTW-MYSTGYKYQIVCKSIG-IQQKSSYYMHPSTVGTHLHGVCKQHHLNCIV-YT-Q 
I-VEHNQYPTLLVCHETHKDYQNSKRKAIALN— MHFYRKLECGMHAYSHTNYHHHSLV 
SSSLTRLLCQTYTQLHWLGNDQRYNSKTNKHHQ-IYRDV-HKNRRVPLFCKLCHYMAEH 
RRTSILLQPEAHT — PLDFLCHEELETYQQAMQTSQPCVVLFCKQN-PSVHLL — GIQQ 
TNQRA-QSTHGLLNI-S--HHN-PHVHFHVPGNVGHGYSEGYP-SPTAEHQS-MGYRHS 
QNPQNDSSRHKYLMK-KSKLHVCHTDNTFFQVQS-QSTSLM-AQSHAPKGRTRLCLTIL 
SVYH-AFVLS-YALHSRASLYT-VV-DV-TGHLVEVLH-LW-ILCYFQCQHNQSVQLLS 
-HL-KILAGEVG-YPQHL-LHDSPLHQSQSTHERDE-LLRG— TY-GNH-LGNSF-NPS 
-R-VYGRSCPLGMPR 
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FIGURE 121 



10 20 30 40 50 60 

I I I I I I 

SEQ ID NO: 10033 TACCGTAGACTCATCTCTATGATGGGTTTCAAAA 

SEQ ID NO: 10084 CCTAGGCATACCCAAAGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCAAAA 
Consensus TACCGTAGACTCATCTCTATGATGGGTTTCAAAA 
Prim, cons . CCTAGGCATACCCAAAGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCAAAA 



70 80 90 100 110 120 

I I I I I I 

SEQ ID NO: 10033 TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 
SEQ ID NO: 10084 TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 
Consensus TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 
Prim, cons . TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 



130 140 150 160 170 180 

I I I I I I 

SEQ ID NO: 10033 ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 
SEQ ID NO: 10084 ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 
Consensus ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 
Prim, cons . ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 



190 200 210 220 230 240 

I I I I I I 

SEQ ID NO:10033 GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
SEQ ID NO:10084 GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
Consensus GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
Prim, cons . GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 



250 260 270 280 290 300 

I I I I I I 

SEQ ID NO: 10033 CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 
SEQ ID NO:10084 CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 
Consensus CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 
Prim, cons . CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 



310 320 330 340 350 360 

I I I I I I 

SEQ ID NO: 10033 CAGGTGACCAGTTTAAACATCTTATACC 

SEQ ID NO:10084 CAGGTGACCAGTTTAAACATCTTATACCACTCATGTATAAAGGCTTGCCCTGGAATGTAG 
Consensus CAGGTGACCAGTTTAAACATCTTATACC 

Prim, cons . CAGGTGACCAGTTTAAACATCTTATACCACTCATGTATAAAGGCTTGCCCTGGAATGTAG 



etc. 
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FIGURE 122 

5' 3' Frame 1 

cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaa 

PRHTQRT-PTVDSSL-WVSK 
tgaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtc 

-ITKSMVTLICLSPAKKLFV 
acgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgg 

TFVRGLALM-RAVMQLEMLW 
gtactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccga 

VLTYLSS-DFLQVLT--LYR 
ctggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccac 

LVMLTLKITQNSPELMQNLH 
caggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtag 

QVTSLNILYHSCIKACPGM- 
tgcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcg 

CVLR-YKCSVIH-KDCQTES 
tgttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattg 

CSSFGRMALSLHQ-STLSRL 
gacctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcag 

DLKERVVCVTNVQLAFLLHQ 
atacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatga 

ILMPAGIILWVLTMSITHL- 
ttgatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgcc 

LMFSSGALRVTFRVTMTNIA 
aggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcag 

RYMEMHMWLVVMLS-LDV-Q 
tccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatg 

SMSALLSALIGLLNTLL-EM 
aactgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgc 

N-GLILLAEKYNTWL-SLHC 
ttgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgc 

LLISFQFFMT-EIQRLSSVC 
ctcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttaca 

LRLK-NGSSTMLSHVVTKLT 
aaatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgttt 

K-RNSSILMLYITINSLMVF 
gtttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttg 

VCFGIVTLIVTQPMQLCVGL 
acacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtga 

TQESCQT-TYQAVMVVVCM- 
ataagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaat 

ISMHSTLQLSIKVHLLI-SN 
tgcctttctttt acta ttctgatagtccttgtgagtctcatggcaaacaagtagtgt egg 

CLS FTILIVLVSLMANK-CR 
atattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtg 

ILIMFHSNLLRVLHDAI-VV 
ctgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatga 

LFADTMQMSTDSTWMHII-- 
tttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaata 

FLLDLAYGFTNNLI LITCGI 
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catttaccaggttacagagttta 
H L P G Y R V 



5' 3' Frame 2 

cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaat 

LGIPKGHDLP-THLYDGFQN 
gaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtca 

ELPSQWLP-YVYHPRRSYSS 
cgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtggg 

RSCVDWL-CRGLSCN-RCCG 
tactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgac 

Y-PTSPARIFYRC-LSSCTD 
tggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacc 

WLC-H-K-HRIHQS-CKTST 
aggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagt 

R-PV-TSYTTHV-RLALECS 
gcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgt 

AY-DSTNAQ-YTERIVRQSR 
gttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgg 

VRPLGAWL-AYINEVLCQDW 
acctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcaga 

T-KNVLSV-QTCNLLFYFIR 
tacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgat 

YLCLL -ES FCGF-LCL-PIYD 
tgatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgcca 

- CSAVGLYG-PSE-P-PTLP 
ggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagt 

GTWKCTCG-L-CYHD-MFSS 
ccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatga 

P-VLC-AR-LVC-IPYYRR- 
actgagggttaattctgcttgc'agaaaagtacaacacatggttgtgaagtctgcattgct 

TEG-FCLQKSTTHGCEVCIA 
tgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcc 

C--VSSSS-HRKSKGYQVCA 
tcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttacaa 

SG-SRMEVLRCSAM--QSLQ 
aatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttg 

NRGTLLFLCYTSR-IH-WCL 
tttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttga 

FVLEL-R-SLPSQCNCV-V- 
cacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaa 

HKSLVKLELTRL-WW-FVCE 
taagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaatt 

-ACIPHSSFR-KCIY-FKAI 
gcctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcgga 

AFLLLF--SL-VSWQTSSVG 
tattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgc 

Y-LCSTQICYVYYTMQFRWC 
tgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgat 



PP20480.019 



151/199 

CLQTPCK-VPTVLGCI-YDD 
ttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaatac 

FCWI-PMDLQTI-YL-PVEY 
atttaccaggttacagagttta 

I Y Q V T E F 



5 f 3 f Frame 3 

cctaggcatacccaaaggacATGacctaccgtagactcatctctatgatgggtttcaaaatg 

-AYPKDMTYRRLI SMMGFKM 
aattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtcac 

NYQVNGYPNMFI.TREEAIRH 
gttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgggt 

VRAWIGFDVEGCHATRDAVG 
actaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgact 

TNLPLQLGFSTGVNLVAVPT 
ggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacca 

GYVDTENNTEFTRVNAKPPP 
ggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagtg 

GDQFKHLI PLMYKGLPWNVV 
cgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgtg 

RIKIVQMLSDTLKGLSDRVV 
ttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgga 

FVLWAHGFELTSMKYFVKIG 
cctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcagat 

PERTCCLCDKRATCFSTSSD 
acttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgatt 

TYACWNHSVGFDYVYNPFMI 
gatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgccag 

DVQQWGFTGNLQSNHDQHCQ 
gtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagtc 

VHGNAHVASCDAIMTRCLAV 
catgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatgaa 

HECFVKRVDWSVEYPIIGDE 
ctgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgctt 

LRVNSACRKVQHMVVKSALL 
gctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcct 

ADKFPVLHDIGNPKAIKCVP 
caggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttacaaa 

QAEVEWKFYDAQPCSDKAYK 
atagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttgt 

IEELFYSYAIHHDKFTDGVC 
ttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttgac 

LFWNCNVDRYPANAIVCRFD 
acaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaat 

TRVLSNLNLPGCDGGSLYVN 
aagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaattg 

KHAFHTPAFDKSAFTNLKQL 
cctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcggat 

PFFYYSDS PCESHGKQVVSD 
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attgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgct 

IDYVPLKSATCITRCNLGGA 
gtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgatt 

VCRHHANEYRQYLDAYNMMI 
tctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaataca 

SAGFSLWIYKQFDTYNLWNT 
tttaccaggttacagagttta 

F T R L Q S L 



3 f 5' Frame 1 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaat 

-TL-PGKCIPQVISIKLFVN 
ccataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactc 

P-AKSSRNHHIICIQVLSVL 
atttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcaga 

ICMVSANSTT-IASCNTRSR 
tttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggact 

FEWNI INIRHYLFAMRLTRT 
atcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagc 

IRIVKERQLL-ISKCTFIES 
tggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagtt 

WSVECMLIHIQTTTITAW-V 
caagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatc 

QV-QDSCVKPTHNCIGWVTI 
aacgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcata 

NVTIPKQTNTISEFIVMYSI 
agaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaa 

RIEEFLYFVSFVTTWLS IVE 
cttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatg 

LPFYFSLRHTLDSLWISYVM 
aagaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttct 

KNWKLISKQCRLHNHVLYFS 
gcaagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaac 

ASRINPQFISYNRVFNRPIN 
gcgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagc 

ALNKALMDC-TSSHDSITTS 
cacatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgt 

HMC I SMYLAMLVMVTLKVTR 
aaagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacaga 

KAPLLNINHKWVIDIVKTHR 
atgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcaca 

MIPAGISI--SRKASCTFVT 
cagacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaa 

QTTRSFRSNLDKVLH-CKLK 
gccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgag 

AMRPKDEHDSV-QSFQCITE 
catttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtat 

HLYYLNTHYIPGQAFIHEWY 
aagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtt 
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KMFKLVTWWRFCINSGEFCV 
attttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcc 

IFSVNITSRYSY-VNTCRKS 
tagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatc 

-LER-VSTHSISSCMTALYI 
aaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagg 

KANPRTNVTNSFFAGDKHIR 
gtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggt 

VTIDLVIHFETHHRDESTVG 
catgtcctttgggtatgcctagg 

H V L W V C L 



3' 5' Frame 2 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaatc 

KLCNLVNVFHRL-VSNCL - I 
cataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactca 

HRLNPAEI I ILYASKYCRYS 
tttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagat 

FAWCLQTAPPKLHRVIHVAD 
ttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggacta 

L SGT-S IS DTTCLP-DSQGL 
tcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagct 

SE--KKGNCFKLVNALLSKA 
ggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagttc 

GVWNACLFTYKLPPSQPGKF 
aagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatca 

KFDKTLVSNLHTIALAG-RS 
acgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataa 

TLQFQNKQTPSVNLS-CIA- 
gaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaac 

E-KSSSIL-ALSLHG-AS-N 
ttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatga 

FHSTSA-GTHLIAFGFPMS- 
agaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctg 

RTGNLSASNADFTTMCCTFL 
caagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacg 

QAELTLSS S PI I GYSTDQS T 
cgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcc 

RLTKHSWTAKHLVMIASQLA 
acatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgta 

TCAFPCTWQCWSWLL-RLPV 
aagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaa 

KPHC-TSIINGL-T-SKPTE 
tgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacac 

-FQQA-VSDEVEKQVARLSH 
agacaacacgttc.tttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaag 

RQHVLSGPILTKYFI DVSSK 
ccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagc 

PCAQRTNTTLSDNPFSVSLS 
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atttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtata 

ICTILIRTTFQGKPLYMSGI 
agatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtta 

RCLNWS PGGGFALTLVNSVL 
ttttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcct 

FSVST - PVGTATKLTPVENP 
agctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatca 

SWRGRLVPTASLVA-QPSTS 
aagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattaggg 

KPIHART-RIASSRVINILG 
taaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggtc 

-PLTW-FILKPIIEMSLR-V 
atgtcctttgggtatgcctagg 

M S F G Y A - 



3'5» Frame 3 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaatcc 

NSVTW-MYSTGYKYQIVCKS 
ataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactcat 

IG-IQQKSSYYMHPSTVGTH 
ttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagatt 

LHGVCKQHHLNCIV-YT-QI 
tgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggactat 

-VEHNQYPTLLVCHETHKDY 
cagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagctg 

QNSKRKAIALN--MHFYRKL 
gagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagttca 

ECGMHAYSHTNYHHHSLVSS 
agtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatcaa 

SLTRLLCQTYTQLHWLGNDQ 
cgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataag 

RYNSKTNKHHQ-IYRDV-HK 
aatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaact 

NRRVPLFCKLCHYMAEHRRT 
tccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatgaa 

SILLQPEAHT--PLDFLCHE 
gaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctgc 

ELETYQQAMQTSQPCVVLFC 
aagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacgc 

KQN-PSVHLL--GIQQTNQR 
gcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcca 

A-QSTHGLLNI-S--HHN-P 
catgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgtaa 

HVHFHVPGNVGHGYSEGYP- 
agccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaat 

SPTAEHQS-MGYRHSQNPQN 
gattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacaca 

DSSRHKYLMK-KSKLHVCHT 
gacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaagc 
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DNTFFQVQS-QSTSLM-AQS 
catgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagca 

HAPKGRTRLCLTILSVYH-A 
tttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtataa 

FVLS-YALHSRASLYT-VV- 
gatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgttat 

DV-TGHLVEVLH-LW - ILCY 
tttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatccta 

FQCQHNQSVQLLS-HL-KIL 
gctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatcaa 

AGEVG-YPQHL-LHDSPLHQ 
agccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagggt 

SQSTHERDE-LLRG--TY-G 
aaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggtca 

NH-LGNSF-NPS-R-VYGRS 
tgtcctttgggtatgcctagg 

C P L G M P R 
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FIGURE 123 



CCTAGGCATACCCAMGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCAAAATGAATTACCMGTCMTGGT 

i N. .N i N 

TACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTCACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCA 

i N N 

TGCAACTAGAGATGCTGTGGGTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 

N 

CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCACCAGGTGACCAGTTTAAACAT 

N N 

CTTATACCACTCATGTATAAAGGCTTGCCCTGGAATGTAGTGCGTATTAAGATAGTACAAATGCTCAGTGATACACTGAA 

N N N 

AGGATTGTCAGACAGAGTCGTGTTCGTCCTTTGGGCGCATGGCTTTGAGCTTACATCAATGAAGTACTTTGTCAAGATTG 

N N 

GACCTGAAAGAACGTGTTGTCTGTGTGACAAACGTGCAACTTGCTTTTCTACTTCATCAGATACTTATGCCTGCTGGAAT 

N 

CATTCTGTGGGTTTTGACTATGTCTATAACCCATTTATGATTGATGTTCAGCAGTGGGGCTTTACGGGTAACCTTCAGAG 

N N N 

TAACCATGACCAACATTGCCAGGTACATGGAAATGCACATGTGGCTAGTTGTGATGCTATCATGACTAGATGTTTAGCAG 

N N N N N i N 

TCCATGAGTGCTTTGTTAAGCGCGTTGATTGGTCTGTTGAATACCCTATTATAGGAGATGAACTGAGGGTTAATTCTGCT 

. . .N N 

TGCAGAAAAGTACAACACATGGTTGTGAAGTCTGCATTGCTTGCTGATAAGTTTCCAGTTCTTCATGACATAGGAAATCC 

i N 

AAAGGCTATCAAGTGTGTGCCTCAGGCTGAAGTAGAATGGAAGTTCTACGATGCTCAGCCATGTAGTGACAAAGCTTACA 

N N N 

AAATAGAGGAACTCTTCTATTCTTATGCTATACATCACGATAAATTCACTGATGGTGTTTGTTTGTTTTGGAATTGTAAC 

N N. . 

GTTGATCGTTACCCAGCCAATGCMTTGTGTGTAGGTTTGACACAAGAGTCTTGTCAAACTTGAACTTACCAGGCTGTGA 

N N 

TGGTGGTAGTTTGTATGTGAATAAGCATGCATTCCACACTCCAGCTTTCGATAAAAGTGCATTTACTAATTTAAAGCAAT 

N N 

TGCCTTTCTTTTACTATTCTGATAGTCCTTGTGAGTCTCATGGCAAACAAGTAGTGTCGGATATTGATTATGTTCCACTC 

i N 

AAATCTGCTACGTGTATTACACGATGCAATTTAGGTGGTGCTGTTTGCAGACACCATGCAAATGAGTACCGACAGTACTT 

N N N 

GGATGCATATAATATGATGATTTCTGCTGGATTTAGCCTATGGATTTACAAACAATTTGATACTTATAACCTGTGGAATA 

. . N N..N N 

CATTTACCAGGTTACAGAGTTTA SEQ ID NO: 10084 
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FIGURE 123 (contd.) 
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FIGURE 124 



Sequences : 
Value 



gi I 74827 | pir | | VFIHJH genome polyprotein lb - murine hepatit.. 
gi 1 14 917044 I sp|P29982|RRPB_CVMJH RNA-directed RNA polymeras . . 
gi|26007546|ref | NP_068 668 . 2 | ORFlab polyprotein [Murine hep.. 
gi|7769342|gb|AAF69332.1|AF208066_2 RNA-directed RNA polyme.. 
gi | 6625761|gb|AAF19384 . 1 | AF201929_2 RNA-directed RNA polyme.. 
gi|2641128|gb|AAB86818.1| RNA-directed RNA polymerase [muri . . 
gi | 4377413 | emb | CAA36202.il open reading frame lb (AA 1-2733.. 
gi|1335921sp|P16342|RRPB_CVMA5 RNA-DIRECTED RNA POLYMERASE 
gi I 2 6008080 | ref | NP_15007 3 . 2 | orflab polyprotein [Bovine cor.. 
180 

gi | 15077820 | gb | AAK83365 . 1 | replicase [bovine coronavirus] 
180 

gi 1 18033972 | gb | AAL57305 . 1 1 replicase [bovine coronavirus] 
180 

gi|7769353|gb|AAF'69342.1|AF208067_2 RNA-directed RNA polyme.. 
180 

gi | 17529672 | gb | AAL4 0397 . 1 | AF220295_2 RNA polymerase lb [bov. . 
177 

gi I 25121571 1 ref | NP_740618 . 1 1 coronavirus nspll [Murine hepa . . 
177 

gi I 26008092 | ref | NP_742 140 . 1 1 coronavirus nspll [Bovine core. 
175 

gi I 10242469 | ref | NP_066134 . 1 1 ORFlab polyprotein; frameshift.. 

163 

gi | 14149033 | emb | CAC39112 . 1 | replicase polyprotein lab [Avia.. 
163 

gi | 458735 | emb | CAA83018 . 1 | potential chimeric protein [Avian.. 
161 

gi | 133594 | sp| P26314 |RRPB_IBVB RNA-DIRECTED RNA POLYMERASE (.. 
161 

gi|29293454 | gb I AAO67706 . 1 | ORFlb polyprotein [Avian inf ecti . . 
160 

gi|25121555|ref |NP_740631.1| coronavirus nspll [Avian infec. . 
158 

gi I 9635157 | ref |NP_058422 . 1 | replicase [Transmissible gastro.. 
153 

gi|19387582|ref |NP_598309.1i Poll [porcine epidemic diarrhe.. 
152 

gi|12175747|ref | NP_073549 . 1 1 replicase polyprotein lab [Hum.. 
151 

gi|133591|sp|P18458|RRPB_BEV RNA-directed RNA polymerase (0. . 
05 

gi 1 1513061 1 dbj | BAA13323 . 1 1 cyanoprotein alpha subunit precu.. 



(bits) 

638 
637 
637 
637 
637 
635 
634 
634 
633 
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633 
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622 

617 
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575 

570 

570 
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50 

35 



Alignments 



>gi I 74827 | pir | | VFIHJH genome polyprotein lb - murine hepatitis virus 
(strain JHM) 

Length = 2731 
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Score = 638 bits (1645), Expect = 0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query : 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct : 1585 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHATRDSIGTNFPLQ 
1644 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+VVRI+IVQ 

Sbjct: 164 5 LGFSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 
1704 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D VV V WA FELT + + YF K+G E C +C+KRATCF++ + Y CW 
Sbjct : 1705 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWR 
1764 

Query: 18 6 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 24 5 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 17 65 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 
1824 

Query: 24 6 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1825 SVNWNLEYPI ISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG — YDF 
1882 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVLS 

Sbjct: 1883 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLSK 

1939 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 4 25 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1940 LNLPGCNGGSLYVNKHAFHTNPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 
1999 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2000 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFT FWVYKTFDFYNLWNTFTRLQS 
2059 

Query: 48 6 L 486 
L 

Sbjct: 2060 L 2060 



>gi|14917044|sp|P29982|RRPB_CVMJH RNA-directed RNA polymerase (ORF1B) 
gi | 7583321 | gb | AAA4 6458 . 2 | open reading frame lb [murine hepatitis virus] 
Length = 2731 

Score = 637 bits (1644), Expect = 0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 
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Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct : 1585 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHATRDSIGTNFPLQ 
1644 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+VVRI+IVQ 

Sbjct: 164 5 LGFSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 
1704 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D VV V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct : 1705 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWR 
1764 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 17 65 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 
1824 

Query: 24 6 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q + + ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1825 S VNWN LE Y P 1 1 SNE VS VNT S C RLLQRVM FRAAMLCNRY DVC Y D I GNPKGL AC VKG — Y DF 
1882 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+ FWNCNVD+ YPANA+ VCRFDTRVLS 

Sbjct: 18 83 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLSK 

1939 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct : 1940 LNLPGCNGGSLYVNKHAFHTNPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 
1999 

Query: 42 6 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 4 85 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2000 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFT FWVYKTFDFYNLWNTFTRLQS 
2059 

Query: 486 L 486 
L 

Sbjct: 2060 L 2060 



>gi I 26007546 | ref | NP_068 668 . 2 | ORFlab polyprotein [Murine hepatitis virus] 
Length = 7178 

Score = 637 bits (1644), Expect =0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 6032 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 
6091 
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Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLI PLM +G W+VVRI+IVQ 

Sb jet : 6092 LGFSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 
6151 

Query: 12 6 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D VV V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct : 6152 MLSDHLADLADSVVLVTWAASFELTCLRYFAKVGREVVCSVCTKRATCFNSRTGYYGCWR 
6211 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAI MTRCLAVH+C F K 
Sb j ct : 6212 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 
6271 

Query: 24 6 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 < 

V+W++EYPII +E+ VN++CR +Q + + ++A+L + + + V +DIGNPK + CV ++ 
Sbjct : 6272 SVNWNLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG--YDF 
6329 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 6330 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 

6386 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 4 25 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sb j ct : 6387 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 
6446 

Query : 4 2 6 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWI YKQFDTYNLWNTFTRLQS 4 85 
+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
N Sbjct: 64 47 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFT FWVYKTFDFYNLWNTFTRLQS 
6506 

Query: 486 L 486 
L 

Sbjct: 6507 L 6507 



>gi|7769342|gb|AAF69332.1|AF208066_2 RNA-directed RNA polymerase [murine 
hepatitis virus] 

Length = 2732 

Score = 637 bits (1644), Expect = 0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRD++GTN PLQ 
Sbjct : 158 6 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIRRVRAWVGFDAEGAHATRDSIGTNFPLQ 
1645 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+VVRI+IVQ 
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Sbjct: 164 6 LGFSTGIDFVVEATGMFAERDGYVFKKAVARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 
1705 

Query: 12 6 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D VV V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct : 1706 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWR 
1765 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 17 66 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDLICSVHKGAHVASSDAIMTRCLAVHDCFCK 
1825 

Query: 24 6 RVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+WS+EYPII +E+ VN++CR +Q ++ ++A+L + + + V +DIGNPK + CV ++ 
Sbjct: 1826 SVNWSLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG — YDF 
1883 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct : 1884 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 

1940 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1941 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 
2000 

Query: 42 6 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

+SATCITRCNLGGAVC HA +YR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2001 RSATCITRCNLGGAVCLKHAEDYREYLESYNTATTAGFT FWVYKTFDFYNLWNTFTRLQS 
2060 

Query: 486 L 486 
L 

Sbjct: 2061 L 2061 



>gi|6625761|gb|AAF19384.1|AF201929_2 RNA-directed RNA polymerase [murine 
hepatitis virus strain 2] 

gi|7739595|gb|AAF68920.1|AF207902_2 RNA-directed RNA polymerase [murine 
hepatitis virus strain ML-11] 
Length = 2733 

Score = 637 bits (1643), Expect = 0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query : 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRD++GTN PLQ 
Sbjct : 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIRRVRAWVGFDAEGAHATRDSIGTNFPLQ 
1646 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+VVRI+IVQ 
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Sbjct: 1647 LGFSTGIDFVVEATGMFAERDGYVFKKAVARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 
1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D VV V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct : 1707 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWR 
1766 

Query: 18 6 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 24 5 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct : 17 67 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDLICSVHKGAHVASSDAIMTRCLAVHDCFCK 
1826 

Query: 24 6 RVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+WS+EYPII +E+ VN++CR +Q + + ++A+L +++ V +DIGNPK + CV ++ 
Sbjct : 1827 SVNWSLEYPI ISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG— YDF 
1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct : 1885 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 

1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct : 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 
2001 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

+SATCITRCNLGGAVC HA +YR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCNLGGAVCLKHAEDYREYLESYNTATTAGFT FWVYKTFDFYNLWNTFTRLQS 
2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi I 2641128 | gb | AAB86818.il RNA-directed RNA polymerase [murine hepatitis 
virus] 

Length = 2733 
Score = 635 bits (1637), Expect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MT YRRL I SMMGFKMNYQVNGY PNMFI TREEAI RHVRAWI GFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 
1646 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 12 5 

LGFSTG++ V TG + F + A+ PPG+QFKHLI PLM +G W+VVRI+IVQ 

Sb j c t : 1647 LGFSTGI DFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLI PLMSRGQKWDVVRIRI VQ 
1706 
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Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D VV V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct : 1707 MLSDHLADLADSVVLVTWAASFELTCLRYFAKVGREVVCSVCTKRATCFNSRTGYYGCWR 
1766 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 24 5 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct : 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 
1826 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q + + ++A+L ++ + V +DIGNPK + CV + + 

Sbjct : 1827 SVNWNLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG--YDF 
1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P ++ + Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct : 1885 KFYDASPV VKSVKQ FVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 

1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct : 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 
2001 

Query: 42 6 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWI YKQFDTYNLWNTFTRLQS 485 

+ S ATC I TRCNLGGAVC HA EYR+YL-f +YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct : 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 
2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi | 4377413 | emb | CAA36202.il open reading frame lb (AA 1-2733) [Murine 
hepatitis virus] 

Length = 2733 

Score = 634 bits (1636), Expect =0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 
1646 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 12 5 

LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+VVRI+IVQ 

Sbjct : 1647 LGFSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 
1706 

Query: 12 6 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
MLSD L L+D VV V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
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Sbjct : 1707 MLS DHLVDLADS VVLVTWAAS FELTCLRY FAKVGRE VVCS VCTKRATC FNSRTGY YGCWR 
1766 

Query: 18 6 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 17 67 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 
1826 

Query: 24 6 RVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG — YDF 
1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P + + + Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct : 1885 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 

1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 
2001 

Query: 42 6 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbj ct : 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 
2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi | 133592 | sp | PI 6342 | RRPB_CVMA5 RNA- DIRECTED RNA POLYMERASE (ORF1B) 

gi I 93916 I pir | | S15760 genome polyprotein - murine hepatitis virus (strain 
A59) 

Length = 2733 
Score = 634 bits (1636), Expect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 
1646 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+VVRI+IVQ 

Sbjct: 1647 LGFSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 
1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D VV V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct : 1707 MLS DHLVDLADSVVLVTWAAS FELTCLRY FAKVGREVVCS VCTKRATC FNSRTGY YGCWR 
1766 
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Query: 18 6 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 24 5 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct : 17 67 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 
1826 

Query: 24 6 RVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPI ISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG— YDF 
1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P + + + Y Y H D+F DG+C+ FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct : 1885 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 

1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct : 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 
2001 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNT FTRLQS 
Sbjct : 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFT FWVYKTFDFYNLWNT FTRLQS 
2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 

>gi|26008080|ref |NP_150073.2| orflab polyprotein [Bovine coronavirus] 
Length = 7094 

Score = 633 bits (1633), Expect = e-180 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 5948 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 
6007 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+VVR +IVQ 

Sbjct: 6008 LGFSTGIDFVVEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDVVRPRIVQ 
6067 

Query: 12 6 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD VV V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 
6127 

Query: 18 6 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
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Sb jet : 6128 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 
6187 

Query: 24 6 RVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 6188 NINWNVEYPI ISNELSINTSCRVLQRVMLKAAMLCNRYTLCYDIGNPKAIACV — KDFDF 
6245 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct: 624 6 KFYDAQP I VKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAVVCRFDTRVLNN 

6302 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sb j ct : 6303 LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYVPL 
6362 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

KSATCITRCNLGGAVC HA EYR+YL-f +YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct : 6363 KSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFT FWVYKTFDFYNLWNTFTKLQS 
6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gi | 15077820 | gb | AAK83365 . 1 | replicase [bovine coronavirus] 
Length = 7094 

Score = 633 bits (1633), Expect = e-180 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 594 8 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 
6007 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+VVR +IVQ 

Sbjct: 6008 LGFSTGIDFVVEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDVVRPRIVQ 
6067 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD VV V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct : 6068 MFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 
6127 

Query: 18 6 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HSV DY+YNP +4-D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct : 6128 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 
6187 

Query: 246 RVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
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++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct : 6188 NINWNVEYPIISNELSINTSCRVLQRVMLKAAMLCNRYTLCYDIGNPKAIACV--KDFDF 
6245 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDAQP L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL-fN 

Sbjct : 624 6 KFYDAQP I VKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAVVCRFDTRVLNN 

6302 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF + LK +PFFYYSD+PC +DYVPL 
Sbjct: 6303 LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYVPL 
6362 

Query: 42 6 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 4 85 

KS ATC I TRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct : 6363 KSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFT FWVYKTFDFYNLWNTFTKLQS 
6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gi 1 18033972 | gb | AAL57305 . 1 I replicase [bovine coronavirus] 
Length = 7094 

Score = 633 bits (1633), Expect - e-180 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query : 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 594 8 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 
6007 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+VVR +IVQ 

Sbjct : 6008 LGFSTGIDFVVEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDVVRPRIVQ 
6067 

Query: 12 6 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD VV V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct : 6068 MFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 
6127 

Query: 18 6 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 24 5 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 6128 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 
6187 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 6188 NINWNVEYPIISNELSINTSCRVLQRVMLKAAMLCNRYTLCYDIGNPKAIACV— KDFDF 
6245 
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Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct : 624 6 KFYDAQP I VKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAVVCRFDTRVLNN 

6302 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK + PFFYYSD+PC +DYVPL 
Sbjct : 6303 LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYVPL 
6362 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

KS ATC I TRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 6363 KSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAG FT FWVYKTFDFYNLWNTFTKLQS 
6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gi 17769353 | gb | AAF69342.1 |AF208067_2 RNA-directed RNA polymerase [murine 
hepatitis virus] 

Length = 2733 

Score = 633 bits (1633), Expect = e-180 

Identities = 285/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

++Y RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct : 1587 VSYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 
1646 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLI PLM +G W+VVRI+IVQ 

Sbjct : 164 7 LGFSTGI DFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLI PLMSRGQKWDVVRIRIVQ 
1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D VV V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct : 1707 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGREVVCSVCTKRATCFNSRTGYYGCWR 
1766 

Query: 18 6 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct : 17 67 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 
1826 

Query: 24 6 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPI I SNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG--YDF 
1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 3 65 
KFYDA P +++ Y Y H D+F DG+C+FWNCNVD-f YPANA+VCRFDTRVL+ 
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Sbjct: 1885 KFYDASPV— VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 
1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 
2001 

Query: 42 6 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 4 85 

+ S ATC I TRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 
2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi|17529672|gb|AAL40397.1|AF220295_2 RNA polymerase lb [bovine 
coronavirus] 

Length = 2685 

Score » 623 bits (1607), Expect = e-177 

Identities - 282/481 (58%), Positives = 365/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1574 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 
1633 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+VVR +IVQ 

Sbjct: 1634 LGFSTGIDFVVEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDVVRPRIVQ 
1693 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD VV V WA FELT ++YF K+G E +C + KRAT +++ + Y CW 
Sbjct : 1694 MFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCNVSTKRATAYNSRTGYYGCWR 
1753 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 24 5 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct : 1754 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 
1813 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct : 1814 NINWNVEYPIISNELSINTSCRVLQRVMLKAAMLCNRYTLCYDIGNPKAIACV--KDFDF 
1871 

Query: 306 KFY DAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDAQP ++ L Y + H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct : 1872 KFYDAQPI VKSVKTLLYFFEAHKDSFKDGLCMFWNCNVDKYPPNAVVCRFDTRVLNN 

1928 
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Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct : 1929 LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYVPL 
1988 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 

KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct : 1989 KSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTKLQS 
2048 

Query: 486 L 486 
L 

Sbjct: 2049 L 2049 



>gi I 25121571 1 ref | NP_740618 . 1 1 coronavirus nspll [Murine hepatitis virus] 
Length = 521 

Score = 622 bits (1603), Expect = e-177 

Identities = 284/479 (59%), Positives - 362/479 (75%), Gaps = 5/479 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct : 48 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 107 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+VVRI+IVQ 

Sbjct: 108 LGFSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 167 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D VV V WA FELT + + YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 168 MLSDHLADLADSVVLVTWAASFELTCLRYFAKVGREVVCSVCTKRATCFNSRTGYYGCWR 227 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 228 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 287 

Query: 24 6 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 288 SVNWNLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG--YDF 345 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 346 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 402 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 4 25 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 403 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 462 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQ 
Sbjct: 4 63 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFT FWVYKTFDFYNLWNTFTRLQ 521 



>gi | 2 6008092 | ref | NP_742140 . 1 1 coronavirus nspll [Bovine coronavirus] 
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Length = 521 
Score = 617 bits (1590), Expect = e-175 

Identities = 282/479 (58%), Positives = 365/479 (76%), Gaps = 5/479 (1%) 



Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 48 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 107 

Query : 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLI PLM +G W+VVR +IVQ 

Sbjct: 108 LGFSTGI DFVVEATGLFADRDGYS FKKAVAKAPPGEQFKHLI PLMTRGQRWDVVRPRI VQ 167 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD VV V WA FELT ++YF K+G E +C +C KRAT + + + + Y CW 
Sbjct: 168 MFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 227 

Query: 18 6 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 228 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 287 

Query: 24 6 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 288 NINWNVEYPIISNELSINTSCRVLQRVMLKAAMLCNRYTLCYDIGNPKAIACV--KDFDF 3'45 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL4-N 

Sbjct: 34 6 KFYDAQPI VKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAVVCRFDTRVLNN 4 02 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 4 03 LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYVPL 4 62 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 

KSATC I TRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQ 
Sbjct: 463 KSATC ITRCNLGGAVCLKHAEEYREYLESYNTATTAG FT FWVYKTFDFYNLWNTFTKLQ 521 



>gi|10242469|ref |NP_066134.1| ORFlab polyprotein; frameshift product 
[Avian infectious bronchitis 
virus] 
Length = 6629 

Score = 575 bits (1482), Expect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 5515 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 
5574 

Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIV 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct : 5575 QVGFSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 
5634 



PP20480.019 



173/199 



Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD VVFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 5635 QMLADNLCNVSDCVVFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 
5693 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 5694 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMTRCLAINNAFC 
5753 

Query: 245 KRVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 5754 QDVNWDLTYPHIANEDEVNSSCRYLQRMYLNACVDALKVNVVYDIGNPKGIKCVRRGDVN 
5813 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 
+ + FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 5814 FR FY DKN P I VRNVKQ FE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 

5870 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 5871 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 
5929 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct : 5930 LATKDC I TKCN I GGAVCKKHAQMY AE FVTS YNAAVTAGFT FWVTNKLNP YNLWKS FS ALQ 
5989 

Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 



>gi 1 14149033 | emb | CAC39112 . 1 1 replicase polyprotein lab [Avian infectious 
bronchitis virus (strain 
Beaudette CK) ] 
Length = 6629 

Score = 575 bits (1482), Expect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA 4-GTNLP 
Sbjct : 5515 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 
5574 



Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIV 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct : 557 5 QVGFSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 
5634 



Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 
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QML+D L +SD VVFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct : 5635 QMLADNLCNVSDCVVFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 
5693 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct : 5694 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMTRCLAINNAFC 
5753 

Query: 24 5 KRVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct : 5754 QDVNWDLTYPHIANEDEVNSSCRYLQRMYLNACVDALKVNVVYDIGNPKGIKCVRRGDVN 
5813 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 
+ + FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 5814 FR FY DKN P I VRN VKQFE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 

5870 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE-f V+ D V 

Sbjct : 5871 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 
5929 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct : 5930 LATKDC I TKCN IGGAVCKKHAQMY AE FVTS YNAAVTAGFT FWVTNKLNP YNLWKS FS ALQ 
5989 

Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 



>gi I 458735 | emb | CAA83018 . 1 1 potential chimeric protein [Avian infectious 
bronchitis virus] 

Length - 2155 

Score = 570 bits (1470), Expect = e-161 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct : 1596 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 
1655 

Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIV 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 165 6 QVGFSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 
1715 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD VVFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1716 QMLADNLCNVSDCVVFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 
1774 
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Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 24 4 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 1775 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMTRCLAINNAFC 
1834 

Query: 24 5 KRVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 1835 QDVNWDLTYPHIANEDEVNSSCRYLQRMYLNACVDALKVNVVYDIGNPKGIKCVRRGDVN 
1894 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 
++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 1895 FRFYDKNPIVRNVKQFE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 

1951 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ + F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 1952 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 
2010 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQ 4 84 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 2011 LATKDCITKCNIGGAVCKKHAQMYAEFVTSYNAAVTAGFT FWVTNKLNPYNLWKSFSALQ 
2070 

Query: 485 SL 486 
S+ 

Sbjct: 2071 SI 2072 



>gi|133594|sp|P26314|RRPB_IBVB RNA- DIRECTED RNA POLYMERASE (ORF1B) 

gi I 74826 | pir | | VFIHB2 genome polyprotein - avian infectious bronchitis 
virus (strain 

Beaudette) 

gi I 292953 | gb | AAA70234 . 1 | pol protein [Avian infectious bronchitis virus] 
gi|331173|gb|AAA46224.1| ORFlb [Avian infectious bronchitis virus] 
Length = 2652 

Score = 570 bits (1469), Expect = e-161 

Identities - 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct : 1538 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 
1597 

Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIV 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct : 1598 QVGFSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 
1657 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 
QML+D L +SD VVFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
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Sbjct : 1658 QMLADNLCNVSDCVVFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 
1716 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 1717 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMTRCLAINNAFC 
1776 

Query: 24 5 KRVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct : 1777 QDVNWDLTYPHIANEDEVNSSCRYLQRMYLNACVDALKVNVVYDIGNPKGIKCVRRGDVN 
1836 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 
++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+ DTR LS 

Sbjct : 1837 FRFYDKNPIVRNVKQFE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 

1893 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct : 1894 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 
1952 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 1953 LATKDC I TKCN I GGAVCKKHAQMYAE FVTS YNAAVT AG FT FWVTNKLN P YNLWKS FS ALQ 
2012 

Query: 485 SL 486 
S+ 

Sbjct: 2013 SI 2014 



>gi | 29293454 | gb | AAO67706.il ORFlb polyprotein [Avian infectious bronchitis 
virus] 

Length = 2649 
Score = 565 bits (1455), Expect = e-160 

Identities = 261/482 (54%), Positives - 342/482 (70%), Gaps = 8/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1538 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 
1597 

Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIV 124 

Q+GFSTG + V P G +DT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 1598 QVGFSTGADFVVTPEGLIDTSIGNNFEPVNSKAPPGEQFNHLRALFKSAKPWHVIRPRIV 
1657 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD VVFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1658 QMLADNLCNVSDCVVFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 
1716 
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Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 24 4 
H +G VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DA+MTRCLA++ F 

Sbjct : 1717 RHCLG VYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASADAVMTRCLAINNAFC 

1773 

Query: 24 5 KRVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 

K V+W ++YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 1774 KDVNWELQYPHIANEDEVNSSCRYLQRMYLNACVDALKVNVVYDIGNPKGIKCVRRGDVN 
1833 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 
+ + FYD P + E Y Y+ H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 1834 FRFYDKNPIVPNVKQFE YDYSQHKDKFADGLCMFWNCNVDCYPENSLVCRYDTRNLS 

1890 

Query : 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDI DYVP 4 24 

NLPGC+GGSLYVNKHAFHTP FD+ + F NLK +PFF+Y SPCE+ V+ D V 

Sbjct : 1891 VFNLPGCNGGSLYVNKHAFHTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 
1949 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ F+ YNLW F+ LQ 
Sbjct : 1950 LATKDCITKCNIGGAVCKKHAQMYAEFVFSYNAAVTAGFTFWVTNNFNPYNLWKNFSALQ 
2009 

Query: 485 SL 486 
S+ 

Sbjct: 2010 SI 2011 



>gi I 25121555 | ref I NP_740631 . 1 1 coronavirus nspll [Avian infectious 
bronchitis virus] 

Length = 521 

Score = 559 bits (1440), Expect = e-158 

Identities = 261/480 (54%), Positives = 342/480 (71%), Gaps = 5/480 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 

++TY+ LIS ++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA -fGTNLP 
Sbjct: 47 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 106 

Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIV 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 107 QVGFSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 166 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD VVFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 167 QMLADNLCNVSDCVVFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 22 5 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 226 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMTRCLAINNAFC 285 

Query: 24 5 KRVDWSVEYPI I GDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 
+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
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Sbjct: 286 QDVNWDLTYPHIANEDEVNSSCRYLQRMYLNACVDALKVNVVYDIGNPKGIKCVRRGDVN 345 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 346 FR FY DKN P I VRN VKQ FE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 402 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 4 03 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 4 61 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 

L + CIT+CN+GGAVC+ HA Y + + + +YN ++AGF+ W+ + + YNLW + F+ LQ 
Sbjct: 4 62 LATKDCITKCNIGGAVCKKHAQMYAEFVTSYNAAVTAGFT FWVTNKLNPYNLWKSFSALQ 521 



>gi I 9635157 | ref | NP_058422 . 1 1 replicase [Transmissible gastroenteritis 
virus] 

gi | 780134 8 | emb | CAB9114 3 . 1 1 replicase [Transmissible gastroenteritis 
virus] 

Length = 6685 
Score = 545 bits (1403), Expect = e-153 

Identities - 261/484 (53%), Positives = 335/484 (69%), Gaps = 13/484 (2%) 

Query: 4 KDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 63 

KD+ Y +IS MGF+ + GY +F TR+ A+R+VRAW+GFDVEG H D VGTN+P 
Sbjct : 5574 KDVKYANVISYMGFRFEANIPGYHTLFCTRDFAMRNVRAWLGFDVEGAHVCGDNVGTNVP 
5633 

Query: 64 LQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKI 123 

LQLGFS GV+ V G V TE V A+ PPG+QF HLIPLM KG PW++VR +1 

Sbjct : 5634 LQLGFSNGVDFVVQTEGCVITEKGNSIEVVKARAPPGEQFAHLIPLMRKGQPWHIVRRRI 
5693 

Query: 124 VQMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIG-PERTCCLCDKRATCFSTSSDTYA 182 

VQM+ D GLSD ++FVLWA G ELT+M+YFVKIG P++ C C K ATC+S+S YA 
Sbjct : 5694 VQMVCDYFDGLSDILIFVLWAGGLELTTMRYFVKIGRPQK--CECGKSATCYSSSQSVYA 
5751 

Query: 183 CWNHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHEC 242 

C+ H++G DY+YNP+ ID+QQWG+TG+L NH + C +H N HVAS DAIMTRCLA+H+C 
Sbjct: 5752 CFKHALGCDYLYNPYCIDIQQWGYTGSLSMNHHEVCNIHRNEHVASGDAIMTRCLAIHDC 
5811 

Query: 24 3 FVKRVDWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAE 302 

FVKRVDWS+ YP I +E ++N A R VQ V+K+AL +HD+GNPK I+C 

Sbjct: 5812 FVKRVDWSIVYPFIDNEEKINKAGRIVQSHVMKAALKIFNPAAIHDVGNPKGIRCA-TTP 
5870 

Query: 303 VEWKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRV 362 
+ W YD P ++ + L Y Y +H +G+ LFWNCNVD YP +IVCRFDTR 

Sbjct : 5871 IPWFCYDRDPINN NVRCLDYDYMVHGQ--MNGLMLFWNCNVDMYPEFSIVCRFDTRT 

5925 

Query: 363 LSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDY 422 
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S L+L GC+GG+LYVN HAFHTPA+D+ AF LK +PFFYY DS CE V +Y 

Sbjct : 592 6 RSKLSLEGCNGGALYVNNHAFHTPAYDRRAFAKLKPMPFFYYDDSNCE LVDGQPNY 

5981 

Query: 423 VPLKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTR 4 82 

VPLKS CIT+CN+GGAVC+ HA YR Y++ YN+ + AGF++W + FDTY LW+ F 
Sbjct : 5982 VPLKSNVCITKCNIGGAVCKKHAALYRAYVEDYNIFMQAGFTIWCPQNFDTYMLWHGFVN 
6041 

Query: 483 LQSL 486 
++L 

Sbjct: 6042 SKAL 6045 



>gi I 19387582 | ref | NP_598309 . 1 1 Poll [porcine epidemic diarrhea virus] 
gi 1 13752450 | gb | AAK38661 . 1 1 Poll [porcine epidemic diarrhea virus] 
Length = 6781 

Score = 541 bits (1394), Expect = e-152 

Identities = 256/480 (53%), Positives = 334/480 (69%), Gaps = 12/480 (2%) 

Query: 8 YRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLG 67 

Y + IS MGF+ + + + +F TR-f A+R+VR W+GFDVEG H VGTN+PLQLG 
Sbjct: 5675 YEHVISFMGFRFDINIPNHHTLFCTRDFAMRNVRGWLGFDVEGAHVVGSNVGTNVPLQLG 
5734 

Query: 68 FSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQML 127 

FS GV+ V P G V TE+ V A+ PPG+QF HL+PL+ +G PW+VVR +IVQM 

Sbjct : 5735 FSNGVDFVVRPEGCVVTESGDYIKPVRARAPPGEQFAHLLPLLKRGQPWDVVRKRIVQMC 
5794 

Query: 128 SDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNHS 187 

SD L LSD + + FVLWA G ELT+M-f YFVKIGP ++C C K ATC+ + + + TY C+ H+ 
Sbjct : 57 95 SDYLANLSDILIFVLWAGGLELTTMRYFVKIGPSKSCD-CGKVATCYNSALHTYCCFKHA 
5853 

Query: 188 VGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVKRV 247 

+G DY+YNP+ ID+QQWG+ G+L NH +HC VH N HVAS DAIMTRCLA+H+CFVK V 
Sbjct : 5854 LGCDYLYNPYCIDIQQWGYKGSLSLNHHEHCNVHRNEHVASGDAIMTRCLAIHDCFVKNV 
5913 

Query: 248 DWSVEYPI IGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEWKF 307 

DWS+ YP IG+E +N + R VQ ++S L ++DIGNPK I+C + +W 

Sbjct: 5 914 DWSITYPFIGNEAVINKSGRIVQSHTMRSVLKLYNPKAI YDIGNPKGIRCA-VTDAKWFC 
5972 

Query: 308 YDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSNLN 3 67 
+D P + +E Y Y I H +F DG+CLFWNCNVD YP ++VCRFDTR S LN 

Sbjct : 5973 FDKNPTNSNVKTLE YDY-ITHGQF-DGLCLFWNCNVDMYPEFSVVCRFDTRCRSPLN 

6027 

Query: 368 LPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKS 427 
L GC+GGSLYVN HAFHTPAFDK AF LK +PFF+Y D+ C+ ++ I+YVPL++ 

Sbjct: 6028 LEGCNGGSLYVNNHAFHTPAFDKRAFAKLKPMPFFFYDDTECD KLQDSINYVPLRA 

6083 
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Query: 428 ATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWI YKQFDTYNLWNTFT-RLQSL 486 

+ CIT+CN+GGAVC H Y Y++AYN SAGF++W+ FDTYNLW TF+ LQ L 
Sbjct : 6084 SNCITKCNVGGAVCSKHCAMYHSYVNAYNTFTSAGFTIWVPTSFDTYNLWQTFSNNLQGL 
6143 



>gi 1 12175747 | ref | NP_07354 9 . 1 1 replicase polyprotein lab [Human coronavirus 
229E] 

gi | 120827401 gb | AAG4 8591 . 1 | AF304460_2 replicase polyprotein lab [Human 
coronavirus 229E] 

Length = 6758 

Score = 535 bits (1379), Expect = e-151 

Identities = 254/478 (53%) , Positives = 329/478 (68%), Gaps = 13/478 (2%) 

Query: 7 TYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQL 66 

TY +IS MGF+ + + G ++F TR+ A+RHVR W+G DVEG H T D VGTN+PLQ+ 
Sbjct : 5642 TYEHVISYMGFRFDVSMPGSHSLFCTRDFAMRHVRGWLGMDVEGAHVTGDNVGTNVPLQV 
5701 

Query: 67 GFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQM 12 6 

GFS GV+ VA P G V T + V A+ PPG+QF H++PL+ KG PW-fV+R +IVQM 
Sbjct: 5702 GFSNGVDFVAQPEGCVLTNTGSVVKPVRARAPPGEQFTHIVPLLRKGQPWSVLRKRIVQM 
5761 

Query: 127 LSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNH 18 6 

++D L G SD +VFVLWA G ELT+M+YFVKIG + C C ATC+++ S+ Y C+ H 
Sbjct : 57 62 IADFLAGSSDVLVFVLWAGGLELTTMRYFVKIGAVKH-CQCGTVATCYNSVSNDYCCFKH 
5820 

Query: 187 SVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVKR 24 6 

++G DYVYNP++ I D+QQWG+ G+L +NH C VH N HVAS DAIMTRCLAV++CFVK 
Sbjct : 5821 ALGCDYVYNPYVIDIQQWGYVGSLSTNHHAICNVHRNEHVASGDAIMTRCLAVYDCFVKN 
5880 

Query: 247 VDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEWK 306 

VDWS+ YP+I +E +N R VQ ++++A+ +HDIGNPK I+C + +W 

Sbjct: 5881 VDWSITYPMIANENAINKGGRTVQSHIMRAAIKLYNPKAIHDIGNPKGIRCA-VTDAKWY 
5939 

Query: 307 FYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSNL 366 
YD P + +E Y Y H DG+CLFWNCNVD YP +IVCRFDTR S L 

Sbjct: 594 0 CYDKNPINSNVKTLE YDYMTHGQ--MDGLCLFWNCNVDMYPEFSIVCRFDTRTRSTL 

5994 

Query: 367 NLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSD-IDYVPL 425 
NL G +GGSLYVN HAFHTPA+DK A LK PFFYY D CE WD ++YVPL 

Sbjct : 5995 NLEGVNGGSLYVNNHAFHTPAYDKRAMAKLKPAPFFYYDDGSCE WHDQVNYVPL 

6049 

Query: 42 6 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRL 483 

++ CIT+CN+GGAVC HAN YR Y+++YN+ AGF++W+ FD YNLW TFT + 
Sbjct: 6050 RATNCITKCNIGGAVCSKHANLYRAYVESYNIFTQAGFNIWVPTTFDCYNLWQTFTEV 6107 
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>gi|133591|sp|P18458|RRPB_BEV RNA-directed RNA polymerase (ORF1B) 
gi I 94017 | pir | | S11238 polymerase - Berne virus 

gi|1334814|emb|CAA36601.1| 2nd polymerase reading frame (AA 1-2291) 
[Berne virus] 

Length = 2291 

Score = 50.1 bits (118), Expect = 8e-05 

Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 11/103 (10%) 

Query: 14 0 FVLWAHGFELTSMKYFVKIGPERTC — CLCDKRATCFSTSSDTYACWNHSVGF — DYVYN 195 
F+L++ +L S+K++V+ TCCC+AC +YCN G + N 

Sbjct : 1511 FILYSCSNDLKSLKFYVEFD TCYFCSCGEMAICLMRDGN-YKCRNCYGGMLISKLVN 

1566 

Query: 196 PFMIDVQQWGFTGNLQSNHDQHC-QVHGNAHVASCDAIMTRCL 237 

+DVQ+ LQ HD C Q HG++H A CDA+MT+CL 

Sbjct: 1567 CKYLDVQKERV--KLQDAHDAICQQFHGDSHEALCDAVMTKCL 1607 



>gi 1 1513061 1 dbj | BAA13323 . 1 1 cyanoprotein alpha subunit precursor 
[Riptortus clavatus] 

Length = 693 

Score = 34.7 bits (78), Expect = 3.7 

Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 1/36 (2%) 

Query: 371 CDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSD 4 06 

C G LY + KHA P FD+ A+ + Q+P FY+ D 
Sbjct: 643 CGGSKLYDSKHAMGFP- FDRPAY PDAFQVPNFY FKD 677 



Database: All non-redundant GenBank CDS 
translations + PDB+SwissProt + PIR-fPRF 

Posted date: Apr 11, 2003 2:30 AM 
Number of letters in database: 454,141,287 
Number of sequences in database: 1,411,415 

Lambda K H 

0.325 0.139 0.456 

Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 473,361,261 

Number of Sequences: 1411415 

Number of extensions: 20503315 

Number of successful extensions: 51018 

Number of sequences better than 10.0: 27 

Number of HSP ? s better than 10.0 without gapping: 26 

Number of HSP's successfully gapped in prelim test: 1 
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Number of HSP's that attempted gapping in prelim test: 50937 

Number of HSP's gapped (non-prelim): 33 

length of query: 486 

length of database: 454,141,287 

effective HSP length: 127 

effective length of query: 359 

effective length of database: 274,891,582 

effective search space: 98686077938 

effective search space used: 98686077938 
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FIGURE 125A 
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FIGURE 126 

5' 3' Frame 1 

QVHQNVCVL-LIFYLMTLSR— SHKICQ-FQKWSRLQLTMLKFHSCFGVRMDMLKPSTQN 

YKQVKRGNQVLRCLTCTRCKECFLKSVTFRIMVKMLLYQKE MSQSILNCVNT-IHLL 

-LYPPT-ELFTLVL 

5' 3' Frame 2 

RFIKMCVFCD-SFT--LCRDNKVTRFVSDFKSGQGYN-LC-NFIHALV-GWTC-NLLPKT 
TSKSSVATRCCDA-LVQDAKNAS-KV-PSELW-KCCYTKRNNDECRKVYSTVSILKYTYF 
SCTLQHESYSLWCW 

5 '3' Frame 3 

GSSKCVCSVIDLLLDDFVEIIKSQDLSVISKVVKVTIDYAEISFMLWCKDGHVETFYPKL 
QASQAWQPGVAMPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMNVAKYTQLCQYLNTLTL 
AVPSNMRVIHFGAG 

3' 5' Frame 1 

PAPK-ITLMLEGTAKVSVFKY-HS-VYFATFIIIPFGITAFSP-F-RSHFSRSILCILYK 
LGIATPGCHA-LACSFG-KVSTCPSLHQSMNEISA-SIVTLTTFEITDKSCDFIISTKSS 
SKRSITEHTHFDEP 

3' 5' Frame 2 

QHQSE-LSCWRVQLK-VYLSIDTVEYTLRHSSLFLLV-QHFHHNSEGHTFQEAFFASCTS 
-ASQHLVATLDLLVVLGRRFQHVHPYTKA-MKFQHSQL-P-PLLKSLTNLVTLLSRQSHQ 
VKDQSQNTHILMNL 

3' 5' Frame 3 

STKVNNSHVGGYS-SKCI-VLTQLSILCDIHHYSFWYNSIFTIILKVTLFKKHSLHLVQV 
RHRNTWLPRLTCL-FWVEGFNMSILTPKHE-NFSIVNCNLDHF-NH-QIL-LYYLDKVIK 
-KINHRTHTF-T 
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FIGURE 127 

5' 3' Frame 1 

-VFTYPGKANQPRSLVDLFSKRTN-NV--WTPIKPT-CPPHYIWWTHRFN-Q-PEWRTAM 
GQGQNSADPKVYPIILRLGSQLSLSMARRNLDSLEARAFQSTPIVVQMTKLATTEELPDE 
FVVVTAK-KSSAPDGTSIT-ELAQKLHFPTALTKKASYGLQLREP-IHPKTTLAPAILIT 
MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE 
VVKLPSRYCC-TD-TSLRAKFLVKANNNKAKLSLRNLLLRHLKSLAKNVLPQNSTTSLKH 
LGDVVQNKPKEISGTKT-SDKELITNIGPQIAQFA 

5' 3' Frame 2 

RFLPTQEKPTNLDLL-ICSLNEQIKMSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGLQW 
GKAKTAPTPRFTQ-YCVLVHSSHSAWQGGT-IPSRPGRSNQHQ-WSR-PNWLLPKSYPTS 
SWW-RQNERAQPQMVLLLPRNWPRSFTSLRR-QRRHRMGCN-GSLEYTQRPHWHPQS— Q 
CCHRATTSSRNNIAKRLLRRGKQRRQSSLFSLLIT-SR-FKKFNSWQQ-GKFSCSNG-RR 
W-NCPRAIAARQIEPA-EQSFW-RPTTTRPNCH-EICC-GI-KASPKTYCHKTVQRHSSI 
WETWSRTNPRKFRGPRPNQTRN-LQTLGRKLHNLP 

5' 3' Frame 3 

GFYLPRKSQPTSISCRSVL-TNKLKCLIMDPNQTNVVPPALHLVDPQIQLTITRMEDCNG 
ARPKQRRPQGLPNNIASWFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRV 
RGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGALNTPKDHIGTRNPNNN 
AATVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGG 
GETALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAF 
GRRGPEQTQGNFGDQDLIRQGTDYKHWAANCTIC 

3' 5' Frame 1 

RQIVQFAAQCL-SVPCLIRSWSPKFPWVCSGPRLPNA-VTLYCFVAVRFWRGFLDASAAD 
FLVTVWPCCCWPLPETLLSSWFNLSSSNSARAVSPPPLAIRAGEFPLLLPGVEFLELPRL 
RDEEREEA-LPPLLPSA-KPFGNVVP-GSCSTVAALLLGLRVPMWSLGVFKAPSVATHTM 
PSLLAP-GSEASGPVPR — KYHLGLSSFILPSPPRTRRVALR — PIWSSGPLLVLIGTPW 
PRGNLSSSLPC-VRAVNQDAILLGKPWGRRCFGLAPLQSSILVIVS-ICGSTKCNAGGTT 
LV-LGSIIRHFNLFV-RTDLQEIEVGWLFLGR-KP 

3' 5' Frame 2 

GKLCNLRPNVCNQFLV-LGLGPRNFLGFVLDHVSQMLE-RCTVLWQYVFGEAF-MPQQQI 
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S— QFGLVVVGLYQKLCSQAGSICLAAIARGQFHHLR-PFEQENFPYCCQELNFLNYRDY 
VMRSEKRLDCRLCFPLRRSLLAMLFLEEVVARWQHCY-DCGCQCGLWVYSRLPQLQPIRC 
LLC-RRREVKLLGQFLGNRSTIWG-ALSFCRHHHELVG-LFGSSQFGHLDHYWC-LERPG 
LEGI-VPPCHAE-EL-TKTQYYWVNLGVGAVLALPHCSPPFWLLSVESVGPPNVMRGALR 
WFDWGPLSDILICSFREQIYKRSRLVGFSWVGKNL 

3' 5' Frame 3 

ANCAICGPMFVISSLSD-VLVPEISLGLFWTTSPKCLSDVVLFCGSTFLARLFRCLSSRF 
LSDSLALLLLAFTRNFALKLVQSV-QQ-REGSFTTSASHSSRRISPTAARS-IS-ITATT 
— GARRGLTAASASLCVEAFWQCCSLRKL-HGGSIVIRIAGANVVFGCIQGSLSCNPYDA 
FFVSAVGK-SFWASS-VIEVPSGAELFHFAVTTTNSSGSSSVVANLVIWTTIGVDWNALA 
SRESKFLLAMLSESCEPRRNIIG-TLGSALFWPCPIAVLHSGYCQLNLWVHQM-CGGHYV 
GLIGVHYQTF-FVRLENRSTRDRGWLAFPG-VKT 



FIGURE 128 

-GLELKL-LTSICAF-PFCYSLF— CLLYFGFHSKSRI-KNLVPKSKRT-NFSLF-LVFL 
YAVAYAL-YSAVHLINLMCLKILVRYNTRGNTYSTAWLCALGKVLPFHRWHTMVQTCTPN 
VTINCQDPAGGALIARCWYLHEGHQTAAFRDVLVVLNKRTN-NV— WTPIKPT-CPPHYI 
WWTHRFN-Q-PEWRTQWGKAKTAPTPRFTQ-YCVLVHSSHSAWQGGT-IPSRPGRSNQHQ 
-WSR-PNWLLPKSYPTSSWW-RQNERAQPQMVLLLPRNWPRSFTSLRR-QRRHRMGCN-G 
SLEYTQRPHWHPQS— QCCHRATTSSRNNIAKRLLRRGKQRRQSSLFSLLIT-SR-FKKF 
NSWQQ-GKFSCSNG-RRW-NCPRAIAARQIEPA-EQSFW-RPTTTRPNCH-EICC-GI-K 
ASPKTYCHKTVQRHSSIWETWSRTNPRKFRGPRPNQTRN-LQTLAANCTICSKCLCILWN 
VTHWHGSHTFGNMADLSWSH-IG-QRSTIQRQRHTAEQAH-RIQNIPTNRA-KGQKEKD- 
-SSAFAAETKEAAHCDSSSC 

EDSSSSFN-LLFVLFSLSAIPCFNNAYYILVFTRNPGSRRTLYQSLNEHETSHCFDLYFS 
MQLHMHCSTALCI— TSCA-RSL-GTTLGVILIALLGFVL-ERFYLFIDGTLWFKHAHLM 
LLSTVKIQLVVRL-LGVGTFMKVTKLLHLETYLLF-INEQIKMSDNGPQSNQRSAPRITF 
GGPTDSTDNNQNGGRNGARPKQRRPQGLPNNIASWFTALTQHGKEELRFPRGQGVPINTN 
SGPDDQIGYYRRATRRVRGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEG 
ALNTPKDHIGTRNPNNNAATVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNS 
TPGSSRGNSPARMASGGGETALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKK 
PRQKRTATKQYNVTQAFGRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGM 
SRIGMEVTPSGTWLTYHGAIKLDDKDPQFKDNVILLNKHIDAYKTFPPTEPKKDKKKKTD 
EAQPLPQRQKKQPTVTLLP 
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RTRAQALIDFYLCFLAFLLFLVLIMLIIFWFSLEIQDLEEPCTKV-TNMKLLIVLTCISL 
CSCICTVVQRCASNKPHVLEDPCKVQH-G-YL-HCLALCSRKGFTFS-MAHYGSNMHT-C 
YYQLSRSSWWCAYS-VLVPS-RSPNCCI-RRTCCFK-TNKLKCLIMDPNQTNVVPPALHL 
VDPQIQLTITRMEDAMGQGQNSADPKVYPIILRLGSQLSLSMARRNLDSLEARAFQSTPI 
VVQMTKLATTEELPDEFVVVTAK-KSSAPDGTSIT-ELAQKLHFPTALTKKASYGLQLRE 
P-IHPKTTLAPAILITMLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQ 
LLAAVGEILLLEWLAEVVKLPSRYCC-TD-TSLRAKFLVKANNNKAKLSLRNLLLRHLKS 
LAKNVLPQNSTTSLKHLGDVVQNKPKEISGTKT-SDKELITNIGRKLHNLLQVPLHSLEC 
HALAWKSHLREHG-LIMEPLNWMTKIHNSKTTSYC-TSTLTHTKHSHQQSLKRTKRKRLM 
KLSLCRRDKRSSPL-LFFL 



FIGURE 129 

5 '3' Frame 1 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccct 

YRRLI SMMGFKMNYQVNGYP 
aatatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgat 

NMFITREEAIRHVRA WIGFD 
gtagagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctagga 

VEGCHATRDAVGTNLPLQLG 
ttttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataac 

FSTGVNLVAVPTGYVDTENN 
acagaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

TEFTRVNAKPPPGDQFKHLI 



5' 3' Frame 2 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttacccta 

TVDSSL-WVSK-ITKSMVTL 
atatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatg 

ICLSPAKKLFVTFVRGLALM 
tagagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctaggat 

-RAVMQLEMLWVLTYLSS-D 
tttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataaca 

FLQVLT--LYRLVMLTLKIT 
cagaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

QNSPELMQNLHQVTSLNILY 
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5' 3' Frame 3 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccctaa 

P-THLYDGFQNELPSQWLP- 
tatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatgt 

YVYHPRRSYSSRSCVDWL-C 
agagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctaggatt 

RGLSCN-RCCGY-PTSPARI 
ttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataacac 

FYRC-LSSCTDWLC-H-K-H 
agaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

RIHQS-CKTSTR-PV-TSYT 



3' 5' Frame 1 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattct 

GIRCLNWS PGGGFALTLVNS 
gtgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaa 

VLFSVST - PVGTATKLTPVE 
aatcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctct 

NPSWRGRLVPTASLVA-QPS 
acatcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacata 

TSKPIHART-RIASSRVINI 
ttagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

LG-PLTW-FILKPIIEMSLR 



3' 5' Frame 2 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctg 

V-DV-TGHLVEVLH-LW-IL 
tgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaa 

CYFQCQHNQSVQLLS-HL-K 
atcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctcta 

ILAGEVG-YPQHL-LHDSPL 
catcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatat 

HQSQSTHERDE-LLRG--TY 
tagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

-GNH-LGNSF-NPS-R-VYG 
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3' 5' Frame 3 



ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgt 

YKMFKLVTWWRFCINSGEFC 
gttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaa 

VIFSVNITSRYSY-VNTCRK 
tcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctac 

S-LER-VSTHSISSCMTALY 
atcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatatt 

IKANPRTNVTNSFFAGDKHI 
agggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

RVTI DLVIHFETHHRDESTV 



FIGURE 130 

10 20 30 40 50 60 

I I I I I I 

SEQ ID NO: 9997 KGHDLRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 

SEQ ID NO: 10034 YRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 

****************************************************** 

Prim. Cons . KGH D2 RRL I SMMG FKMNYQVNG Y PNMF I TREEA I RHVRAW IG FDVEGCHATRDAVGTNLP 

70 80 90 100 110 120 

I I I I I I 

SEQ ID NO: 9997 LQLGFSTGVNLVAVPTGYVDTENNTKFTRVNAQTSTSEQFKHLIPLMYKGLPWNVVRIKI 

SEQ ID NO: 10034 LQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLI 

*************************.******. .****** 

Prim, cons . LQLGFSTGVNLVAVPTGYVDTENNT2FTRVNA222222QFKHLIPLMYKGLPWNVVRIKI 

130 140 150 160 170 180 
I I I I I I 
SEQ ID NO: 9997 VQMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 
SEQ ID NO: 10034 

Prim, cons . VQMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 

190 200 

I I 

SEQ ID NO: 9997 WNHSVGFDYVYNPFMI DVQQWGLYG 
SEQ ID NO:10034 
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FIGURE 131 

5' 3' Frame 1 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgaga 

QVHQNVCVL-LIFYLMTLSR 
taataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatg 

--SHKICQ-FQKWSRLQLTM 
ctgaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaac 

LKFHSCFGVRMDMLKPSTQN 
tacaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaa 

YKQVKRGNQVLRCLTCTRCK 
gaatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaag 

ECFLKSVTFRIMVKMLLYQK 
gaataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttactt 

E - - - M SQS I LNCVNT - I HLL 
tagctgtaccctccaacatgagagttattcactttggtgctgg 

-lyppt-elftlvl" 



5" 3' Frame 2 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagat 

RFIKMCVFCD-SFT--LCRD 
aataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgc 

NKVTRFVSDFKSGQGYN-LC 
tgaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaact 

-NFIHALV-GWTC-NLLPKT 
acaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaag 

TSKSSVATRCCDA-LVQDAK 
aatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagg 

NAS-KV-PSELW-KCCYTKR 
aataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttt 

NNDECRKVYSTVSILKYTYF 
agctgtaccctccaacatgagagttattcactttggtgctgg 

SCTLQHESYSLWCW 



5' 3' Frame 3 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagata 

GSSKCVCSVIDLLLDDFVEI 
ataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgct 

IKSQDLSVISKVVKVTIDYA 
gaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaacta 

EISFMLWCKDGHVETFYPKL 
caagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaaga 

QASQAWQPGVAMPNLYKMQR 
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atgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagga 
MLLEKCDLQNYGENAVI PKG 

ataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttta 
IMMNVAKYTQLCQYLNTLTL 

gctgtaccctccaacatgagagttattcactttggtgctgg 

avpsnmrvihfga" 



3' 5' Frame 1 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaag 

PAPK-ITLMLEGTAKVSVFK 
tattgacacagttgagtatactttgcgacattcatcattattccttttggtataacagca 

Y-HS-VYFATFIIIPFGITA 
ttttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaag 

FSP-F-RSHFSRSILCILYK 
ttaggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtt 

LGIATPGCHA-LACSFG-KV 
tcaacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacc 

STCPSLHQSMNEISA-SIVT 
ttgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatca 

LTTFEITDKSCDFI ISTKSS 
agtaaaagatcaatcacagaacacacacattttgatgaacctg 

SKRSITEHTHFDEP 



3' 5' Frame 2 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagt 

QHQSE-LSCWRVQLK-VYLS 
attgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcat 

IDTVEYTLRHSSLFLLV-QH 
tttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagt 

FHHNSEGHTFQEAFFASCTS 
taggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggttt 

-ASQHLVATLDLLVVLGRRF 
caacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacct 

QHVHPYTKA-MKFQHSQL-P 
tgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaa 

-PLLKSLTNLVTLLSRQSHQ 
gtaaaagatcaatcacagaacacacacattttgatgaacctg 

VKDQSQNTHILMNL 



3' 5' Frame 3 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagta 
STKVNNSHVGGYS-SKCI-V 
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ttgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcatt 

LTQLSILCDIHHYSFWYNSI 
ttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagtt 

FTI ILKVTLFKKHSLHLVQV 
aggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtttc 

RHRNTWLPRLTCL-FWVEGF 
aacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacctt 

NMSILTPKHE-NFSIVNCNL 
gaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaag 

DHF-NH-QIL-LYYLDKVIK 
taaaagatcaatcacagaacacacacattttgatgaacctg 

-KINHRTHTF--T 



FIGURE 132 

5 1 3 1 Frame 1 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctct 

-VFTYPGKANQPRSLVDLFS 
aaacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccg 

KRTN-NV--WTPIKPT-CPP 
cattacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatg 

HYIWWTHRFN-Q-PEWRTAM 
gggcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttca 

GQGQNSADPKVYPIILRLGS 
cagctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaa 

QLSLSMARRNLDSLEARAFQ 
tcaacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgag 

STPIVVQMTKLATTEELPDE 
ttcgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctag 

FVVVTAK-KSSAPDGTSIT- 
gaactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttg 

ELAQKLHFPTALTKKASYGL 
caactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataaca 

QLREP-IHPKTTLAPAILIT 
atgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcag 

MLPPCYNFLKEQHCQKASTQ 
agggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaatt 

REAEAAVKPLLAPHHVVAVI 
caagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggag 

QEIQLLAAVGEILLLEWLAE 
gtggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaag 

VVKLPSRYCC-TD-TSLRAK 
tttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgagg 
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FLVKANNNKAKLSLRNLLLR 
catctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcat 

HLKSLAKNVLPQNSTTSLKH 
ttgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagac 

LGDVVQNKPKEI SGTKT-SD 
aaggaactgattacaaacattgggccgcaaattgcacaatttgcct 

KELITNIGPQIAQFA 



5' 3' Frame 2 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctcta 

RFLPTQEKPTNLDLL-ICSL 
aacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgc 

NEQIKMSDNGPQSNQRSAPR 
attacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatgg 

ITFGGPTDSTDNNQNGGLQW 
ggcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcac 

GKAKTAPTPRFTQ-YCVLVH 
agctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaat 

SSHSAWQGGT-IPSRPGRSN 
caacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagt 

QHQ-WSR-PNWLLPKSYPTS 
tcgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagg 

SWW-RQNERAQPQMVLLLPR 
aactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgc 

NWPRSFTSLRR-QRRHRMGC 
aactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaa 

N-GSLEYTQRPHWHPQS--Q 
tgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcaga 

CCHRATTSSRNNIA KRLLRR 
gggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaattc 

GKQRRQSSLFSLLIT-SR-F 
aagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggagg 

KKFNSWQQ-GKFSCSNG-RR 
tggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagt 

W-NCPRAIAARQIEPA-EQS 
ttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggc 

FW-RPTTTRPNCH-EICC-G 
atctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcatt 

I-KASPKTYCHKTVQRHSSI 
tgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagaca 

WETWSRTNPRKFRGPRPNQT 
aggaactgattacaaacattgggccgcaaattgcacaatttgcct 

RN-LQTLGRKLHNLP 



5*3' Frame 3 
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taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctctaa 

GFYLPRKSQPTSISCRSVL- 
acgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgca 

TNKLKCLIMDPNQTNVVPPA 
ttacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatggg 

LHLVDPQIQLTITRMEDCNG 
gcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcaca 

ARPKQRRPQGLPNNIASWFT 
gctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaatc 

ALTQHGKEELRFPRGQGVPI 
aacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagtt 

NTNSGPDDQIGYYRRATRRV 
cgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagga 

RGGDGKMKELSPRWYFYYLG 
actggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgca 

TGPEASLPYGANKEGIVWVA 
actgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaat 

TEGALNTPKDHIGTRNPNNN 
gctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcagag 

AATVLQLPQGTTLPKGFYAE 
ggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaattca 

GSRGGSQASSRSSSRSRGNS 
agaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggaggt 

RNSTPGSSRGNSPARMASGG 
ggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagtt 

GETALALLLLDRLNQLESKV 
tctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggca 

SGKGQQQQGQTVTKKSAAEA 
tctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcattt 

SKKPRQKRTATKQYNVTQAF 
gggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagacaa 

GRRG PEQ TQGN FGDQDL I RQ 
ggaactgattacaaacattgggccgcaaattgcacaatttgcct 

GTDYKHWAANCTIC 



3 ! 5' Frame 1 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtct 

RQIVQFAAQCL-SVPCLIRS 
tggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacg 

WSPKFPWVCSGPRLPNA-VT 
ttgtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagat 

LYC FVAVRFWRGFLDASAAD 
ttcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagc 

FLVTVWPCCCWPLPETLLSS 
tggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccatt 
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WFNLSSSNSARAVSPPPLAI 
cgagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgacta 

RAGEFPLLLPGVEFLELPRL 
cgtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcct 

RDEEREEA-LPPLLPSA-KP 
tttggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattg 

FGNVVP-GSCSTVAALLLGL 
cgggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatg 

RVPMWSLGVFKAPSVATHTM 
ccttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaag 

PSLLAP-GSEASGPVPR--K 
taccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagct 

YHLGLSSFILPSPPRTRRVA 
cttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctgg 

LR--PIWSSGPLLVLIGTPW 
cctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgca 

PRGNLSSSLPC-VRAVNQDA 
atattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcc 

ILLGKPWGRRCFGLAPLQSS 
attctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacg 

ILVIVS-ICGSTKCNAGGTT 
ttggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatcta 

LV-LGSI IRHFNLFV-RTDL 
caagagatcgaggttggttggcttttcctgggtaggtaaaaaccta 

QEIEVGWLFLGR-KP 



3'5' Frame 2 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtctt 

GKLCNLRPNVCNQFLV-LGL 
ggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgt 

GPRNFLGFVLDHVSQMLE-R 
tgtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagatt 

CTVLWQYVFGEAF-MPQQQI 
tcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagct 

S--Q FGLVVVGLYQKLCSQA 
ggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattc 

GSICLAAIARGQFHHLR-PF 
gagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactac 

EQENFPYCCQELNFLNYRDY 
gtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcctt 

VMRSEKRLDCRLCFPLRRSL 
ttggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgc 

LAMLFLEEVVARWQHCY-DC 
gggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgc 

GCQCGLWVYSRLPQLQPIRC 
cttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagt 
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LLC-RRREVKLLGQFLGNRS 
accatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctc 

TIWG-ALSFCRHHHELVG-L 
ttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggc 

FGSSQFGHLDHYWC-LERPG 
ctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaa 

LEGI-VPPCHAE-EL-TKTQ 
tattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcca 

YYWVNLGVGAVLALPHCSPP 
ttctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgt 

FWLLSVESVGPPNVMRGALR 
tggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatctac 

WFDWGPLSDILICSFREQIY 
aagagatcgaggttggttggcttttcctgggtaggtaaaaaccta 

KRSRLVGFSWVGKNL 



3 f 5' Frame 3 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtcttg 

ANCAICGPMFVISSLSD-VL 
gtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgtt 

VPEISLGLFWTTSPKCLSDV 
gtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagattt 

VLFCGSTFLARLFRCLSSRF 
cttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagctg 

LSDSLALLLLAFTRNFALKL 
gttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattcg 

VQSV-QQ-REGSFTTSASHS 
agcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactacg 

SRRISPTAARS-IS-ITATT 
tgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagccttt 

-GARRGLTAASASLCVEAF 
tggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgcg 

WQCCSLRKL-HGGSIVIRIA 
ggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgcc 

GANVVFGCIQGSLSCNPYDA 
ttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagta 

FFVSAVGK-SFWASS-VIEV 
ccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctct 

PSGAELFHFAVTTTNSSGSS 
tcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggcc 

SVVANLVIWTTIGVDWNALA 
tcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaat 

SRESKFLLAMLSESCEPRRN 
attattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctccat 

IIG-TLGSALFWPCPIAVLH 
tctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgtt 
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SGYCQLNLWVHQM-CGGHYV 
ggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatctaca 

GLIGVHYQTF-FVRLENRST 
agagatcgaggttggttggcttttcctgggtaggtaaaaaccta 

RDRGWLAFPG-VKT 
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FIGURE 133 
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FIGURE 134 
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